altolending

Choose the Right One: Evaluating Topic Models for Business Intelligence

Topic models are used in businesses to classify brand-related text datasets (such as product and site reviews, surveys, and social media comments) and to track how customer satisfaction metrics change over time. There is a myriad of recent topic models one can choose from: the widely used BERTopic by Maarten Grootendorst (2022), the recent FASTopic

Choose the Right One: Evaluating Topic Models for Business Intelligence Read More »

Exporting MLflow Experiments from Restricted HPC Systems

Many High-Performance Computing (HPC) environments, especially in research and educational institutions, restrict communications to outbound TCP connections. Running a simple command-line ping or curl with the MLflow tracking URL on the HPC bash shell to check packet transfer can be successful. However, communication fails and times out while running jobs on nodes. This makes it

Exporting MLflow Experiments from Restricted HPC Systems Read More »

How to Benchmark DeepSeek-R1 Distilled Models on GPQA Using Ollama and OpenAI’s simple-evals

The recent launch of the DeepSeek-R1 model sent ripples across the global AI community. It delivered breakthroughs on par with the reasoning models from Meta and OpenAI, achieving this in a fraction of the time and at a significantly lower cost. Beyond the headlines and online buzz, how can we assess the model’s reasoning abilities

How to Benchmark DeepSeek-R1 Distilled Models on GPQA Using Ollama and OpenAI’s simple-evals Read More »

An Existential Crisis of a Veteran Researcher in the Age of Generative AI

I was a researcher fifteen years ago. A PhD candidate doing Research for long days. I was swamped with many articles, annotations, emails, bookmarks, etc. When I found a citation manager tool, Mendeley, I felt so relaxed. It was like I had control over the process again. When I found a bookmark manager, XBookmark, I

An Existential Crisis of a Veteran Researcher in the Age of Generative AI Read More »

Why Most Cyber Risk Models Fail Before They Begin

Cybersecurity leaders are being asked impossible questions. “What’s the likelihood of a breach this year?” “How much would it cost?” And “how much should we spend to stop it?” Yet most risk models used today are still built on guesswork, gut instinct, and colorful heatmaps, not data. In fact, PwC’s 2025 Global Digital Trust Insights

Why Most Cyber Risk Models Fail Before They Begin Read More »

Explained: How Does L1 Regularization Perform Feature Selection?

Feature Selection is the process of selecting an optimal subset of features from a given set of features; an optimal feature subset is the one which maximizes the performance of the model on the given task. Feature selection can be a manual or rather explicit process when performed with filter or wrapper methods. In these

Explained: How Does L1 Regularization Perform Feature Selection? Read More »

Enterprise AI: From Build-or-Buy to Partner-and-Grow

Not long ago, a cooperation partner casually approached me with an AI use case at their organization. They wanted to make their onboarding process for new staff more efficient by using AI to answer the repetitive questions of newcomers. I suggested a practical chat approach that would integrate their internal documentation, and off they went

Enterprise AI: From Build-or-Buy to Partner-and-Grow Read More »