Lead ML Engineer · Author · Mentor
Lead ML Engineer @ Adobe — building intelligent systems at scale.
Author of 2 published ML books. Bengaluru, India.
About
I'm a Lead ML Engineer at Adobe with over 13 years in the industry, building systems where machine learning meets real-world scale. My career spans TCS, HackerEarth, Nineleaps, Yellow.ai, and now Adobe — each chapter sharpening my focus on where systems meet intelligence.
I've authored two published books — Practical Machine Learning with Rust (Apress) and fastText Quick Start Guide (Packt) — and I write regularly on Medium about LLMs, ML system design, and the math behind modern deep learning.
Currently focused on LLM inference optimization, Flash Attention, quantization, and large-scale NLP pipelines. Available for 1:1 mentorship on Topmate.
Current Focus
Currently at
Adobe
Lead ML Engineer
⬤ ActiveQuick Stats
Writing
A deep dive into Flash Attention — the algorithm that made training large transformers feasible by rethinking how attention is computed in memory.
Read on MediumExploring what the Vera Rubin Observatory teaches us about scale, data, and the infrastructure choices that will define the next generation of AI systems.
Read on MediumBreaking down Denoising Diffusion Probabilistic Models from first principles — the forward process, reverse process, and the loss function that ties it together.
Read on MediumA practical overview of quantization strategies for large language models — INT8, GPTQ, AWQ — and how each affects inference speed and model quality in production.
Read on MediumML system design walkthrough for a scalable harmful ad detection pipeline — covering data flows, model selection, and trade-offs at production scale.
Read on MediumEnd-to-end ML system design for ranking comments at scale — feature engineering, model architecture, online/offline evaluation, and serving infrastructure.
Read on MediumShrinking AI models from feast to fit — a clear explanation of quantization fundamentals, why it matters, and how it reduces memory and inference cost without sacrificing accuracy.
Read on MediumYouTube
2:32:57
Deriving Flash Attention: The Math, the Hardware, and the Triton Implementation
51:43
The MATH behind Diffusion Models DDPM
59:04
LLM Quantization Techniques Explained — GPTQ AWQ GGUF HQQ BitNet
31:23
LLM Quantization Explained
18:12
The AI Breakthrough Nobody Expected: Inside DeepSeek R1
35:40
Easily Finetune Llama 3.2 for Your Use Case | Upload to Ollama | Unsloth
24:02
Meta's LLaMA 3.2: Big Leap into Multimodal AI | RUN LOCALLY
23:14
GraphRAG from Microsoft. Is it good? Implementation with Ollama + Llama 3.1
16:34
MultiAgent AI System using CrewAI + Llama 3.1
Published Work
My first book bridges two worlds I love: Rust's performance guarantees and the richness of the ML ecosystem. It covers supervised, unsupervised, and reinforcement learning, computer vision, NLP, and deployment — all in Rust. Written for engineers who want memory-safe, high-performance ML beyond Python.
A practical guide to Facebook's fastText — the library that made fast, scalable text classification accessible to everyone. Covers word representations, text classification, integration with Keras, TensorFlow, and PyTorch, and mobile deployment. A great entry point for applied NLP.
Open Source
practical-machine-learning-w-rust
Source code and examples for Practical Machine Learning with Rust — ML algorithms, computer vision, NLP, and deployment, all in Rust.
fastText-Quick-Start-Guide
Code repository for the fastText Quick Start Guide — hands-on examples for text classification, word embeddings, and ML framework integrations.
94 repositories and counting
View all reposCareer
2022 → Present
Adobe
Lead ML Engineer
⬤ CurrentBuilding and scaling ML systems at Adobe — focused on LLM inference, NLP pipelines, and intelligent content understanding. Research-to-production work with emphasis on performance, reliability, and scale.
Jan 2021 → 2022
Yellow.ai
Engineering Manager — NLP
Led the NLP infrastructure team building and deploying thousands of conversational AI models at scale. Architected multi-lingual, multi-modal NLP pipelines from research through Kubernetes-based production serving.
May 2017 → Dec 2020
Nineleaps
Team Lead — ML Platform
Grew from individual contributor to technical lead on the ML platform — building production data pipelines, ML models, and leading the team through architecture reviews and mentorship.
Nov 2016 → May 2017
HackerEarth
Category Head — Python
Built the Python practice section for B2C developer engagement, creating content and tooling for a high-engagement coding education platform.
2011 → 2016
TCS & SLK
Systems Engineer
Early career in systems engineering at Tata Consultancy Services and SLK Software — building automation tools, configuring J2EE applications, and developing foundational software engineering instincts.
Expertise
ML / DL Frameworks
Languages
Infrastructure
Specializations
Mentorship
I offer 1:1 mentorship for engineers looking to grow in ML, navigate career transitions, or sharpen their system design skills. Whether you're preparing for senior interviews or shipping your first production model, I've been there.
Career guidance, technical deep-dives, interview prep, and more — on your schedule.
View availability on Topmate