Models for Search Ranking

Simple Approaches to Search Ranking In the world of search, ranking models are crucial. They determine the order in which results are displayed, impacting user experience and engagement. But does ranking always have to be complex? The answer might surprise you. 1. Understanding Ranking ModelsRanking models are algorithms used by […]

Various Search Relevance Algorithms.

Various search relevance algorithms have been developed over the years to improve the quality of search results. Some of these methods are foundational, while others are cutting-edge and have arisen from advancements in machine learning and natural language processing. Here’s a list of some popular search relevance algorithms and methods: […]

What is BM25f

BM25f is an extension of the BM25 scoring function, which is a part of the family of ranking functions used in information retrieval. BM25 itself is a modern alternative to the classic TF-IDF scheme, designed to rank documents based on their relevance to a given query. Here’s a breakdown of […]

What is TF-IDF

TF-IDF stands for Term Frequency-Inverse Document Frequency. It’s a numerical statistic used to indicate the importance of a word in a document relative to a collection of documents, often called a corpus. TF-IDF is commonly used in the field of information retrieval and text mining. Here’s a breakdown: Why is […]

Google Cloud Functions

Google Cloud Functions are serverless code functions that run without you having to manage or scale the underlying infrastructure. This makes building them really easy. So let’s build an example. Here’s normal NodeJS function with two parameters – request and response. The incoming requst is automatically parsed for JSON body […]

About Scalding

Scalding is a Scala library. Scalding is easy to work with and reason about the data in distributed systems like Hadoop. It presents the data as a collection and allows to perform the computation on data in a matter that is similar to Scala API, so it appears to the […]

Top 10 Big Data Trends for 2017

Tableau published a paper on Top 10 Big Data Trends for 2017 that you can find here: We disagree that speeding up Hadoop as number 1 trend. The author takes the evolutionary approach and not revolutionary one. What needed more and more is the event driven processing. The machinery […]

Installing Scala 2.12 on CentOS

wget tar xvf scala-2.12.0.tgz sudo mv scala-2.12.0 /usr/lib sudo ln -s /usr/lib/scala-2.12.0 /usr/lib/scala export PATH=$PATH:/usr/lib/scala/bin scala -version

Code Musing

“I am sorry I have had to write you such a long letter, but I did not have time to write you a short one” Pascal, Blaise (1623 – 1662) –¬†French philosopher and mathematician. At the age of 18 he invented the first calculating machine.   So I wonder why […]

What is new in SOLR 6.x

Solr 6 builds on the innovation of Solr 5 obviously. First of all – let’s take a look at what was done in Solr 5. There were improvements for “bin/solr” and “bin/post” – easy to startup Solr, add new documents, more APIs were introduced. The user interface was rewritten in […]