about_me
Hi! I'm Sumit! I'm a software engineer (day job) and an independent machine learning researcher. I have a strong desire to contribute to the field of ML research & development to make the world a better place. You can learn more about my research interests in my research site if you're interested. But regardless, welcome!
You can learn more about me here where I go into further detail about my work and interests. Only if you're interested, of course.
If you'd like to get in touch, you can find me on LinkedIn or send me an email. I love talking over coffee, so I'm always up for a cup and chat if you're around the Tokyo area.
current_focus
| Studying | Transformer, GRPO, and more SOTA building blocks | ongoing |
| Writing | Technical articles on Machine Learning | in-progress |
| Learning by doing | Mini-projects and experiments with SOTA models | active |
featured_blogs
featured_projects
Emergence in Mixture of Experts
Emergence of expertise across different domains when replacing a dense FFN with a MoE.
Jan 29, 2026
Seq2Seq: From Scratch to Training & Inference
Implementation of the original Seq2Seq architecture from scratch, with training and inference pipelines.
Aug 10, 2025

