Welcome to

Hamed's Webpage

Machine Learning and AI

Hamed Firooz

I have over 14 years of experience delivering large-scale AI solutions and leading multi-year technology strategies. I’ve spent more than seven years managing research and engineering teams across multiple sites, driving innovation and impact in AI.

Current position: Principal Staff AI Scientist at LinkedIn Core AI
Education: PhD from University of Washington (UW)

Education

2012

PhD (Electrical Engineering)
University of Washington
Compressed Sensing and Network Coding
2008

MSc
University of Tehran
Peer-to-peer networks

Experience

2023 - Current

Principal Staff AI Scientist
LinkedIn Core AI
I have formed and currently lead a team of over 50 AI scientists and engineers to pre-train, post-train, and deploy a 200B+ parameter foundational model for LinkedIn’s personalization tasks at scale.
2018 - 2023

Sr. Staff AI Tech Lead Manager
Meta AI
Led a medium-sized team with diverse profiles, research scientists and software engineers. Our mission was to advance AI technologies to keep users safe online. My team built multimodal content understanding services used across many Meta integrity products.
2016 - 2018

Staff Machine Learning Engineer
LinkedIn
Led LinkedIn Ads Sponsored Update relevance (five engineers, one analyst, one PM). The team was responsible for modeling and ranking advertising content on the LinkedIn news feed that shows ads from millions of advertisers to hundreds of millions of LinkedIn daily active users.
2015 - 2016

Staff Machine Learning Tech Lead Manager
Base CRM (acquired by Zendesk)
Led a four-engineer group for forecasting. We were responsible for a) Predicting sales attributes (dollar amount, closed date, and the closing probability) for the Sales team b) Predicting the possibility of churn for the Customer Success (CSM) team. media coverage
2012 - 2015

Senior Machine Learning Engineer
Falkonry (acquired by IFS)
Built an early warning system based on the Bayesian network that provides diagnosis and prognosis of large industrial machines.

Highlights & News

2025
[April 2025] Gave a talk on 360Brew at the LinkedIn AI & Data Community event.
I explained how 360Brew unifies core AI components into a single layer, streamlining ranking and recommendation workflows, eliminating technical debt, and accelerating developer productivity.
[Feb 2025] To get insight into In-Context Learning and Chain-of-Thought reasoning we published CoT-ICL Lab
CoT-ICL Lab is a framework to study chain-of-thought (CoT) and in-context learning (ICL) by decoupling causal structure from token processing functions. Experiments show that 1) CoT accelerates the accuracy transition to higher values across model sizes 2) deeper models require fewer in-context examples to leverage CoT effectively while more examples help shallow models match deeper model performance. Along with detailed analyses we provide theoretical insights. The code is available here.
[Feb 2025] We released our technical report about how to productionize 360Brew, our LLM-based foundation model for LinkedIn personalization products
In this work we show how to leverage On-Policy forward Knowledge Distillation, Model Compression (Pruning and Quantization), and Serving Optimizations (detailed on RadixAttention, FlashInfer, and tensor parallelism) to deliver a 20x reduction in both cost and latency while maintaining the quality of our 360Brew XL model.
2024
[Dec 2024] We released our technical report on LLM-based foundation model for personalized predictions.
In this report, we demonstrate that 360Brew model, a 150B parameter foundation model trained on 1T tokens can solve over 30 personalization tasks on LinkedIn platform without task-specific fine-tuning and no complex feature engineering. It can generalize to out-of-domain tasks and surfaces, and achieves performance similar to or better than the production model.
[Oct 2024] We published our findings, about LLM Lost-in-Distance phenomena.
In this work we demonstrate that LLM's performance is affected by the relative distance between pieces of information in the context. The further apart the information is within long context, the more the model’s performance deteriorates.
[Aug 2024] My team open-sourced Liger Kernel for memory efficient and fast LLM training.
Liger Kernel is a collection of Triton kernels designed specifically for LLM training. It can effectively increase multi-GPU training throughput by 20% and reduces memory usage by 60%.
[May 2024] Enhancing Stability for Large Language Models Training in Constrained Bandwidth Networks is accepted to ICML'24 FoMo-ES workshop.
This system-model co-design work focuses on leveraging synchronization in data parallelism hierarchical partitioning to avoid race condition in gradient updates for LLM training.
[Mar 2024] RESPROMPT: Residual Connection Prompting Advances Multi-Step Reasoning in Large Language Models is accepted to NACCL'24.
We formulate CoT as a reasoning graph and propose a prompt strategy for multi-step reasoning that can capture complex processes in tasks such as mathematics and commonsense reasoning.
[Feb 2024] My team contributed to Open Source DeepSpeed implementation of ZeRO++ hierarchical partitioning.
The race condition between AllGather and device-to-device copy for the 2nd partition causes instability in training large models such as Llama-7B and Falcon-40B on a moderately large number of GPUs. After discovering the algorithmic issue, we landed the fix in the DeepSpeed repository.
2023
[Sep 2023] Our paper on Understanding the detrimental class-level effects of data augmentation is accepted to Neurips 2023.
We propose a framework for understanding how Data Augmentation interacts with class-level learning dynamics. We show that simple class-conditional augmentation strategies informed by our framework improve performance on the negatively affected classes.

[Jun 2023] My team’s content- and preference-understanding AI foundation models lifted Reels watch-time by 15 percent.
Leveraging multimodal foundation models — XLM-R for language and FLAVA (plus MViT) for vision/video — to parse semantics across text, images and clips, we pair a hierarchical retrieval-and-ranking stack that distills billions of posts into the few hundred most relevant to each person in milliseconds, fueling unconnected-content discovery across Feed, Explore and Reels.
2022
[Oct 2022] My team launched one-tap “Show More / Show Less” controls that let people directly steer their recommendations.
We trained a deep learning model on millions of explicit SMSL feedback, fusing them with existing engagement features to predict how users preference on new content—and then blend that preference score into real-time ranking. The result: giving everyone finer-grained, self-tuning personalization.
WIP

Contact

Welcome to

Hamed's Webpage

Hamed Firooz

Education

PhD (Electrical Engineering)

MSc

Experience

Principal Staff AI Scientist

Sr. Staff AI Tech Lead Manager

Staff Machine Learning Engineer

Staff Machine Learning Tech Lead Manager

Senior Machine Learning Engineer

Get in touch