technical notes – Hamed's Blog

Some Resources for Reinforcement Learning Basic

July 30, 2025 by hamed

Reinforcement learning (RL) has surged in popularity over the past few months, largely thanks to Large Reasoning Models (LRMs) and test‑time scaling techniques. A solid understanding of RL fundamentals often makes the difference between a model that merely trains and one that converges stably and efficiently when applied to LLMs. To refresh my own knowledge, I’ve been revisiting … Read more

My Long-Running Notebook of AI & LLM

July 19, 2025July 6, 2025 by hamed

“I never start a project without skimming my own notes first.”— Every engineer the night before a deadline I’ve been writing down my AI/LLM notes in last 6 months—what started as a few scrappy records about Transformer quirks has turned into a 75 pages of tips that spans everything from model tweaks to prod inference … Read more