Blog

Your blog category

Why CatBoost Works So Well: The Engineering Behind the Magic

Gradient boosting is a cornerstone technique for modeling tabular data due to its speed and simplicity. It delivers great results without any fuss. When you look around you’ll see multiple options like LightGBM, XGBoost, etc. Catboost is one such variant. In this post, we will take a detailed look at this model, explore its inner […]

Why CatBoost Works So Well: The Engineering Behind the Magic Read More »

Time Series Forecasting Made Simple (Part 1): Decomposition and Baseline Models

I used to avoid time series analysis. Every time I took an online course, I’d see a module titled “Time Series Analysis” with subtopics like Fourier Transforms, autocorrelation functions and other intimidating terms. I don’t know why, but I always found a reason to avoid it. But here’s what I’ve learned: any complex topic becomes

Time Series Forecasting Made Simple (Part 1): Decomposition and Baseline Models Read More »

Circuit Tracing: A Step Closer to Understanding Large Language Models

Context Over the years, Transformer-based large language models (LLMs) have made substantial progress across a wide range of tasks evolving from simple information retrieval systems to sophisticated agents capable of coding, writing, conducting research, and much more. But despite their capabilities, these models are still largely black boxes. Given an input, they accomplish the task

Circuit Tracing: A Step Closer to Understanding Large Language Models Read More »

Avoiding Costly Mistakes with Uncertainty Quantification for Algorithmic Home Valuations

When you’re about to buy a home, whether you’re an everyday buyer looking for your dream house or a seasoned property investor, there’s a good chance you’ve encountered automated valuation models, or AVMs. These clever tools use massive datasets filled with past property transactions to predict the value of your potential new home. By considering

Avoiding Costly Mistakes with Uncertainty Quantification for Algorithmic Home Valuations Read More »

Let’s Call a Spade a Spade: RDF and LPG — Cousins Who Should Learn to Live Together

In recent years, there has been a proliferation of articles, LinkedIn posts, and marketing materials presenting graph data models from different perspectives. This article will refrain from discussing specific products and instead focus solely on the comparison of RDF (Resource Description Framework) and LPG (Labelled Property Graph) data models. To clarify, there is no mutually

Let’s Call a Spade a Spade: RDF and LPG — Cousins Who Should Learn to Live Together Read More »