Posts
All the articles I've posted.
-
GPU vs TPU
Updated:Decoding the Battle of AI Accelerators in 2025
-
Why Does Retrieval-Augmented Generation (RAG) Exist?
Updated:In the rapidly evolving world of artificial intelligence, large language models (LLMs) like GPT-4 or Grok have transformed how we interact with technology.
-
Understanding Tokenizers in AI — A Deep Dive into ChatGPT, Grok, and Gemini
Updated:A complete guide to tokenizers in modern LLMs, covering BPE, WordPiece, SentencePiece, Unigram, and how ChatGPT, Grok, and Gemini tokenize text. Includes examples, real-world impact, and why tokenization is the foundation of AI.
-
KV Cache Explained
Updated:A Deep Dive into Transformer Optimization