CI/CD for Machine Learning: It’s Not Just « Code + Data »

Hey everyone! It’s been a minute since my last post. Work has been absolutely crazy lately had a few tight deadlines that ate up all my free time. But I finally came up for air, and I wanted to document something I’ve been wrapping my head around recently. As an ai engineer, I thought I … Lire la suite CI/CD for Machine Learning: It’s Not Just « Code + Data »

Évaluez ceci :

The Road to LLM: What Does It Mean to Embed Words? [Day 4]

Hello everyone! We explored tokenizers—tools that divide text into units called tokens. In today’s post, we’ll dive into the concept of word embedding, using a machine learning model called Word2Vec, which focuses on word-level embeddings. What Is Word Embedding? Simply put, it looks like this: ID Word 1 cat 2 dog 3 bird 4 fox 5 tiger … Lire la suite The Road to LLM: What Does It Mean to Embed Words? [Day 4]

Évaluez ceci :

The Road to LLM: What is a Tokenizer? [Day 3]

Hello everyone! 👋 In our previous article, we explored Natural Language Processing (NLP) and how computers need to convert text into numbers (distributed representations) to understand human language 🗣️ ➡️ 🔢 Today, we’ll examine the crucial step that makes this conversion possible: tokenization! ✨ A tokenizer breaks down text into smaller pieces that can be … Lire la suite The Road to LLM: What is a Tokenizer? [Day 3]

Évaluez ceci :