Blog

Your blog category

NumExpr: The “Faster than Numpy” Library Most Data Scientists Have Never Used

Browsing GitHub the other day, I came across a library I’d never heard of before. It was called NumExpr. I was immediately interested because of some claims made about the library. In particular, it stated that for some complex numerical calculations, it was up to 15 times faster than NumPy.  I was intrigued because, up […]

NumExpr: The “Faster than Numpy” Library Most Data Scientists Have Never Used Read More »

Behind the Magic: How Tensors Drive Transformers

Introduction Transformers have changed the way artificial intelligence works, especially in understanding language and learning from data. At the core of these models are tensors (a generalized type of mathematical matrices that help process information) . As data moves through the different parts of a Transformer, these tensors are subject to different transformations that help

Behind the Magic: How Tensors Drive Transformers Read More »

LLM Evaluations: from Prototype to Production

Evaluation is the cornerstone of any machine learning product. Investing in quality measurement delivers significant returns. Let’s explore the potential business benefits. As management consultant and writer Peter Drucker once said, “If you can’t measure it, you can’t improve it.” Building a robust evaluation system helps you identify areas for improvement and take meaningful actions

LLM Evaluations: from Prototype to Production Read More »

Choose the Right One: Evaluating Topic Models for Business Intelligence

Topic models are used in businesses to classify brand-related text datasets (such as product and site reviews, surveys, and social media comments) and to track how customer satisfaction metrics change over time. There is a myriad of recent topic models one can choose from: the widely used BERTopic by Maarten Grootendorst (2022), the recent FASTopic

Choose the Right One: Evaluating Topic Models for Business Intelligence Read More »

Exporting MLflow Experiments from Restricted HPC Systems

Many High-Performance Computing (HPC) environments, especially in research and educational institutions, restrict communications to outbound TCP connections. Running a simple command-line ping or curl with the MLflow tracking URL on the HPC bash shell to check packet transfer can be successful. However, communication fails and times out while running jobs on nodes. This makes it

Exporting MLflow Experiments from Restricted HPC Systems Read More »

How to Benchmark DeepSeek-R1 Distilled Models on GPQA Using Ollama and OpenAI’s simple-evals

The recent launch of the DeepSeek-R1 model sent ripples across the global AI community. It delivered breakthroughs on par with the reasoning models from Meta and OpenAI, achieving this in a fraction of the time and at a significantly lower cost. Beyond the headlines and online buzz, how can we assess the model’s reasoning abilities

How to Benchmark DeepSeek-R1 Distilled Models on GPQA Using Ollama and OpenAI’s simple-evals Read More »