My Brain Cells
🏷️ Tags
💻 Profile
profile_image
anthony
AI/ML Engineer
I am a software engineer with a passion for AI/ML.
🔎 Search
📂 All Posts
🦾 ML/AI
The Practical Guide to RAG: Types, Techniques, and How (and When) to Use Each

The Practical Guide to RAG: Types, Techniques, and How (and When) to Use Each

Nov 6, 2025

This guide gives you a field‑tested taxonomy of RAG types, the key techniques behind them, and a decision framework for when to use what—plus implementation recipes you can adapt today.

Rag
🦾 ML/AI
Guide to “RAY” by Anyscale

Guide to “RAY” by Anyscale

Nov 5, 2025

AI
🦾 ML/AI
8-Stage Lifecycle of Modern LLM Applications

8-Stage Lifecycle of Modern LLM Applications

Nov 4, 2025

LLMOps (Large Language Model Operations)

LLM
🦾 ML/AI
“Verl" for LLM Reinforcement Learning (Beyond Pre-training)

“Verl" for LLM Reinforcement Learning (Beyond Pre-training)

Oct 28, 2025

Volcano Engine Reinforcement Learning for LLMs

LLM
🦾 ML/AI
OLMo

OLMo

Oct 25, 2025

OLMo isn't just another model; it's a scientific artifact. It's built on a philosophy of being "fully open," providing the entire recipe, not just the finished cake.

LLM
🦾 ML/AI
DeepSpeed

DeepSpeed

Oct 24, 2025

DeepSpeed is an open-source deep learning optimization library for PyTorch, created by Microsoft.

AI
LLM
🦾 ML/AI
Finetune LLMs 2-5x Faster: An In-Depth Guide to Unsloth

Finetune LLMs 2-5x Faster: An In-Depth Guide to Unsloth

Oct 23, 2025

Unsloth is an open-source AI library designed to significantly speed up LLM finetuning and reduce memory usage.

AI
🦾 ML/AI
A Deep Dive into CrewAI for Collaborative AI Systems

A Deep Dive into CrewAI for Collaborative AI Systems

Oct 10, 2025

Beyond Single Agents - The Rise of Collaborative AI

Open Source
LLM
AI Agent
🦾 ML/AI
Beyond Linear Chains: A Deep Dive into “LangGraph” for Building Stateful AI Agents

Beyond Linear Chains: A Deep Dive into “LangGraph” for Building Stateful AI Agents

Oct 8, 2025

When Chains Aren't Enough

LLM
Open Source
AI
🦾 ML/AI
How to Evaluate an LLM (Accuracy, Performance & Latency)

How to Evaluate an LLM (Accuracy, Performance & Latency)

Oct 5, 2025

Evaluating a large language model (LLM) isn’t one metric—it’s a multi‑dimensional scorecard. This guide shows you how to build a practical, repeatable evaluation for accuracy, performance, and latency

LLM
AI
🦾 ML/AI
Evaluate RAG

Evaluate RAG

Oct 4, 2025

RAG systems aren't foolproof—they can suffer from poor retrieval of irrelevant documents or hallucinations in generated outputs.

LLM
🦾 ML/AI
Reinforcement Learning from Human Feedback (RLHF)

Reinforcement Learning from Human Feedback (RLHF)

Oct 3, 2025

Reinforcement Learning from Human Feedback (RLHF) is a technique that uses human preferences as a guide for training models – essentially using reinforcement learning (RL) to directly optimize a model’s behavior based on what humans find desirable.

LLM
AI
🦾 ML/AI
A Beginner's Guide to LangChain

A Beginner's Guide to LangChain

Oct 2, 2025

LangChain is a powerful framework for building applications powered by LLMs.

AI
Machine Learning
🦾 ML/AI
The Deep Dive into LoRA

The Deep Dive into LoRA

Oct 2, 2025

LoRa: Low-Rank Adaptation

LLM
🦾 ML/AI
NVIDIA Inference Microservices (NIMs)

NVIDIA Inference Microservices (NIMs)

Sep 27, 2025

NIM is a pre-built, optimized, and containerized AI model that you can run with a single command.

LLM
AI
Software Engineering
🦾 ML/AI
NVIDIA Triton Streamlines Your Path to Production

NVIDIA Triton Streamlines Your Path to Production

Sep 26, 2025

Open Source
AI
🤖 Computer Science
The CUDA: From Foundational Principles to High-Performance Parallel Computing

The CUDA: From Foundational Principles to High-Performance Parallel Computing

Sep 25, 2025

This foundational section establishes the historical and conceptual context for CUDA, explaining why it was created and the fundamental problem it solves. We will move from the general concept of GPU computing to the specific architecture of a CUDA-powered application.

Programming
🦾 ML/AI
SGLang: The Engine That's Redefining High-Performance LLM Programming

SGLang: The Engine That's Redefining High-Performance LLM Programming

Sep 25, 2025

Fast Inference engine with low latency

Open Source
Machine Learning
AI
🦾 ML/AI
Deep Dive into vLLM

Deep Dive into vLLM

Sep 6, 2025

The conceptual innovations of PagedAttention and Continuous Batching are realized through a well-engineered, modular software architecture designed for performance, extensibility, and production readiness.19 Understanding the key components of the vLLM engine reveals a system built not just for speed, but also for scalability and maintainability.

Open Source
AI
🦾 ML/AI
Generate High-Quality Synthetic Data 📊 for ML/DL & GenAI Projects

Generate High-Quality Synthetic Data 📊 for ML/DL & GenAI Projects

Sep 5, 2025

Synthetic data helps you move faster, protect privacy, balance classes, and stress-test edge cases.

Machine Learning
🦾 ML/AI
LegalLLM 👩🏼‍💼: Revolutionizing Legal Analytics with AI

LegalLLM 👩🏼‍💼: Revolutionizing Legal Analytics with AI

Sep 4, 2025

LegalLLM, a multi-task Large Language Model designed specifically for the complexities of U.S. legal analytics.

AI
🦾 ML/AI
Deep Dive into NVIDIA TensorRT with PyTorch and ONNX

Deep Dive into NVIDIA TensorRT with PyTorch and ONNX

Sep 2, 2025

Deep Dive into NVIDIA TensorRT with PyTorch and ONNX

Open Source
LLM
Machine Learning
AI
🤖 Computer Science
System Design Interview: Design Instagram 📸

System Design Interview: Design Instagram 📸

Aug 18, 2025

We’ll cover requirements, architecture, scaling challenges, data modeling, caching, APIs, and more — with diagrams to keep things visual.

System Design
📗 Docs
VS Code Keyboard Shortcuts Every Developer Should Know

VS Code Keyboard Shortcuts Every Developer Should Know

Jun 1, 2025

Visual Studio Code (VS Code) has become one of the most popular code editors thanks to its powerful features and extensibility. However, mastering VS Code goes beyond installing extensions and tweaking settings — it’s about efficiency. Keyboard shortcuts can dramatically speed up your development workflow, reduce context switching, and keep your hands on the keyboard. This post covers essential VS Code shortcuts every developer should know, organized by categories and tailored for both Windows/Linux and macOS users.

Productivity
🤖 Computer Science
Big O Cheat Sheet

Big O Cheat Sheet

May 31, 2025

Understanding algorithmic complexity is crucial for writing efficient code and making informed decisions about data structures and algorithms. This cheat sheet covers the fundamentals of Big O notation, common time and space complexities, and real-world examples with code snippets.

📊 Data Science
Time Series Forecasting Methods in Data Science

Time Series Forecasting Methods in Data Science

May 31, 2025

Time series forecasting is a cornerstone of data science, enabling practitioners to predict future values based on historically observed data points. From stock prices and weather patterns to website traffic and sensor readings, accurate forecasts inform strategic decisions across industries. In this post, we'll explore a range of methods—from classic statistical techniques to modern machine learning and deep learning approaches—and demonstrate how to implement them in Python.

Machine Learning
Data Science
🤖 Computer Science
PyCaret Guide

PyCaret Guide

May 30, 2025

PyCaret is an open-source, low-code machine learning library in Python that automates and streamlines the entire ML workflow

Open Source
automation
Machine Learning
🤖 Computer Science
Scraping Images from the Web Using Selenium

Scraping Images from the Web Using Selenium

May 30, 2025

Scraping Images from the Web Using Selenium. It covers everything from environment setup to running a robust scraper, with code snippets and best practices.

automation
web
Productivity
🤖 Computer Science
System Design: Complete Guide for Interviews

System Design: Complete Guide for Interviews

May 30, 2025

System design interviews assess your ability to architect large-scale services. Unlike algorithm questions, they focus on high-level thinking: gathering requirements, making trade-offs, and balancing non-functional needs (scale, reliability, maintainability). In this guide you’ll learn a repeatable framework, key building blocks, common patterns, and a worked example.

System Design
Interview
Software Engineering
🦾 ML/AI
The Ultimate Cheat Sheet: Picking the Right Model, Optimizer & LR for Every Scenario

The Ultimate Cheat Sheet: Picking the Right Model, Optimizer & LR for Every Scenario

May 29, 2025

In supervised-learning, unsupervised-learning, time-series, deep-learning and reinforcement-learning tasks, each modeling problem brings its own “sweet-spot” of algorithms, solvers/optimizers, and hyperparameter defaults. Below is a practical guide to choosing models, optimizers (or solvers), learning‐rate heuristics and when to reach for each technique.

Machine Learning
📗 Docs
Github Complete Guide

Github Complete Guide

May 29, 2025

GitHub has become the de facto platform for hosting, collaborating on, and managing software projects. Whether you’re an individual developer or part of a large team, understanding how to leverage GitHub effectively can dramatically improve your workflow. In this “Complete Guide,” we’ll walk through everything from the basics of Git and repositories, to advanced features like pull requests, GitHub Actions, and best practices for collaboration.

Github
📗 Docs
Web Scraping using Selenium guide

Web Scraping using Selenium guide

May 28, 2025

Web scraping is the automated process of extracting information from websites. While simple HTTP requests and HTML parsing libraries (like requests and BeautifulSoup) work for many static sites, dynamic pages driven by JavaScript require a browser-like environment. That’s where Selenium comes in: a powerful browser-automation tool that can drive a real (or headless) browser to render pages, interact with elements, and retrieve the fully generated HTML.

web
automation
Scarping
🦾 ML/AI
How to run Streamlit in google colab

How to run Streamlit in google colab

May 17, 2025

Streamlit is a Python library for building web apps. Running apps locally depends on your system’s configuration and compatibility, which can sometimes be a hurdle.

Machine Learning
Streamlit
Open Source
🦾 ML/AI
Personal Finance Analysis with Local LLMs

Personal Finance Analysis with Local LLMs

May 11, 2025

LLM
Git
📕 Book
Think and Grow Rich

Think and Grow Rich

May 6, 2025

10 Lessons from the book

Docs
💻 Profile
anthony
AI/ML Engineer
I am a software engineer with a passion for AI/ML.
🌟 Service
💬 Contact
github
instagram
email
linkedin