Data Science – Page 2 – The AI Sector

A Simple Implementation of the Attention Mechanism from Scratch

Data ScienceApril 1, 202560Views 0Likes 0Comments

Introduction The Attention Mechanism is often associated with the transformer architecture, but it was already used in RNNs. In Machine Translation or MT (e.g., English-Italian) tasks, when you want to predict the next Italian word, you need your model to focus, or pay attention, on the most important English words that are useful to make…

Automate Supply Chain Analytics Workflows with AI Agents using n8n

Data ScienceMarch 27, 202571Views 0Likes 0Comments

Why build things the hard way when you can design them the smart way? As a Supply Chain Data Scientist, I’ve explored various frameworks like LangChain and LangGraph to build AI agents using Python. Leveraging LLMs with LangChain for Supply Chain Analytics — A Control Tower Powered by GPT — (Image by Samir Saci) The illustration above is from an article…

Evolving Product Operating Models in the Age of AI

Data ScienceMarch 22, 202567Views 0Likes 0Comments

previous article on organizing for AI (link), we looked at how the interplay between three key dimensions — ownership of outcomes, outsourcing of staff, and the geographical proximity of team members — can yield a variety of organizational archetypes for implementing strategic AI initiatives, each implying a different twist to the product operating model. Now…

Mastering Hadoop, Part 3: Hadoop Ecosystem: Get the most out of your cluster

Data ScienceMarch 17, 202558Views 0Likes 0Comments

As we have already seen with the basic components (Part 1, Part 2), the Hadoop ecosystem is constantly evolving and being optimized for new applications. As a result, various tools and technologies have developed over time that make Hadoop more powerful and even more widely applicable. As a result, it goes beyond the pure HDFS…

Mastering Hadoop, Part 1: Installation, Configuration, and Modern Big Data Strategies

Data ScienceMarch 12, 202569Views 0Likes 0Comments

Nowadays, a large amount of data is collected on the internet, which is why companies are faced with the challenge of being able to store, process, and analyze these volumes efficiently. Hadoop is an open-source framework from the Apache Software Foundation and has become one of the leading Big Data management technologies in recent years.…

How to Spot and Prevent Model Drift Before it Impacts Your Business

Data ScienceMarch 7, 202572Views 0Likes 0Comments

Despite the AI hype, many tech companies still rely heavily on machine learning to power critical applications, from personalized recommendations to fraud detection. I’ve seen firsthand how undetected drifts can result in significant costs — missed fraud detection, lost revenue, and suboptimal business outcomes, just to name a few. So, it’s crucial to have robust…

Vision Transformers (ViT) Explained: Are They Better Than CNNs?

Data ScienceMarch 2, 202572Views 0Likes 0Comments

1. Introduction Ever since the introduction of the self-attention mechanism, Transformers have been the top choice when it comes to Natural Language Processing (NLP) tasks. Self-attention-based models are highly parallelizable and require substantially fewer parameters, making them much more computationally efficient, less prone to overfitting, and easier to fine-tune for domain-specific tasks [1]. Furthermore, the…

Enhancing RAG: Beyond Vanilla Approaches

Data ScienceFebruary 25, 202570Views 0Likes 0Comments

Retrieval-Augmented Generation (RAG) is a powerful technique that enhances language models by incorporating external information retrieval mechanisms. While standard RAG implementations improve response relevance, they often struggle in complex retrieval scenarios. This article explores the limitations of a vanilla RAG setup and introduces advanced techniques to enhance its accuracy and efficiency. The Challenge with Vanilla…

Advanced Time Intelligence in DAX with Performance in Mind

Data ScienceFebruary 20, 202565Views 0Likes 0Comments

We all know the usual Time Intelligence function based on years, quarters, months, and days. But sometimes, we need to perform more exotic timer intelligence calculations. But we should not forget to consider performance while programming the measures. Introduction There are many Dax functions in Power BI for Time Intelligence Measures. The most common are: You…

How I Became A Machine Learning Engineer (No CS Degree, No Bootcamp)

Data ScienceFebruary 15, 202565Views 0Likes 0Comments

Machine learning and AI are among the most popular topics nowadays, especially within the tech space. I am fortunate enough to work and develop with these technologies every day as a machine learning engineer! In this article, I will walk you through my journey to becoming a machine learning engineer, shedding some light and advice…