Some Resources for Reinforcement Learning Basic

Reinforcement learning (RL) has surged in popularity over the past few months, largely thanks to Large Reasoning Models (LRMs) and test‑time scaling techniques. A solid understanding of RL fundamentals often makes the difference between a model that merely trains and one that converges stably and efficiently when applied to LLMs. To refresh my own knowledge, I’ve been revisiting … Read more

My Long-Running Notebook of AI & LLM

“I never start a project without skimming my own notes first.”— Every engineer the night before a deadline I’ve been writing down my AI/LLM notes in last 6 months—what started as a few scrappy records about Transformer quirks has turned into a 75 pages of tips that spans everything from model tweaks to prod inference … Read more