Reinforcement Learning Alignment

✨ Coming Soon: Adversarial RL for LLM, Specultive Decoding, Triton Kernels, and more ...

A glimpse into the research directions I'm actively exploring. From accelerating LLMs to building multi-modal intelligent agents. This is what's coming next.

Aug 15, 2025