Lead ML Engineer · Author · Mentor

Joydeep
Bhattacharjee

Lead ML Engineer @ Adobe — building intelligent systems at scale.
Author of 2 published ML books. Bengaluru, India.

2 Published Books
670+ Medium Followers
94 GitHub Repos
13+ Years in Tech
Joydeep Bhattacharjee

Engineer, author,
and lifelong learner.

I'm a Lead Machine Learning Engineer at Adobe with over 13 years in the industry, building systems where machine learning meets real-world scale. My career spans TCS, SLK, HackerEarth, Nineleaps, Yellow.ai, Samsung Semiconductor, and now Adobe — each chapter sharpening my focus on where systems meet intelligence.


I've authored two published books — Practical Machine Learning with Rust (Apress) and fastText Quick Start Guide (Packt) — and I write regularly on Medium about LLMs, ML system design, and the math behind modern deep learning.


Currently building core GenAI services and APIs on Adobe Firefly — focused on diffusion models, transformers, GPU-accelerated inference pipelines, and optimizing latency and throughput at scale. Available for 1:1 mentorship on Topmate.

Current Focus

GenAI Services Diffusion Models GPU Optimization LLM Inference Flash Attention Quantization ML System Design

Currently at

Adobe

Lead ML Engineer

⬤ Active

Quick Stats

GitHub Stars 133+
Published Books 2
Medium Followers 670+
Open Source Repos 94

Recent Articles

March 2026

Understanding Flash Attention

A deep dive into Flash Attention — the algorithm that made training large transformers feasible by rethinking how attention is computed in memory.

Read on Medium
March 2026

Vera Rubin and the Future of AI

Exploring what the Vera Rubin Observatory teaches us about scale, data, and the infrastructure choices that will define the next generation of AI systems.

Read on Medium
September 2025

The Math Behind Diffusion Models — DDPM

Breaking down Denoising Diffusion Probabilistic Models from first principles — the forward process, reverse process, and the loss function that ties it together.

Read on Medium
August 2025

LLM Quantization Techniques

A practical overview of quantization strategies for large language models — INT8, GPTQ, AWQ — and how each affects inference speed and model quality in production.

Read on Medium
July 2025

Design: Harmful Ad Detection System

ML system design walkthrough for a scalable harmful ad detection pipeline — covering data flows, model selection, and trade-offs at production scale.

Read on Medium
June 2025

Design: Comment Ranking System

End-to-end ML system design for ranking comments at scale — feature engineering, model architecture, online/offline evaluation, and serving infrastructure.

Read on Medium
2024

LLM Quantization Explained

Shrinking AI models from feast to fit — a clear explanation of quantization fundamentals, why it matters, and how it reduces memory and inference cost without sacrificing accuracy.

Read on Medium

Videos

Books

Practical Machine Learning with Rust

Practical Machine Learning with Rust

83 stars 27 forks Apress

My first book bridges two worlds I love: Rust's performance guarantees and the richness of the ML ecosystem. It covers supervised, unsupervised, and reinforcement learning, computer vision, NLP, and deployment — all in Rust. Written for engineers who want memory-safe, high-performance ML beyond Python.

fastText Quick Start Guide

fastText Quick Start Guide

50 stars 10 forks Packt

A practical guide to Facebook's fastText — the library that made fast, scalable text classification accessible to everyone. Covers word representations, text classification, integration with Keras, TensorFlow, and PyTorch, and mobile deployment. A great entry point for applied NLP.

GitHub Projects

practical-machine-learning-w-rust

Source code and examples for Practical Machine Learning with Rust — ML algorithms, computer vision, NLP, and deployment, all in Rust.

83 27

fastText-Quick-Start-Guide

Code repository for the fastText Quick Start Guide — hands-on examples for text classification, word embeddings, and ML framework integrations.

50 10

94 repositories and counting

View all repos

Professional Experience

Sep 2025 → Present

Adobe

Lead Machine Learning Engineer

⬤ Current
  • Lead technical development of core GenAI services and APIs integrating generative models on Adobe Firefly
  • Architecting and developing enterprise-scale ML workflows for model customization, serving, and ecosystem integration with first-party and third-party generative models
  • Building and optimizing GPU-accelerated pipelines for model training and inference using PyTorch, CUDA, Triton, and TensorRT
  • Providing hands-on technical leadership — driving architecture decisions, design reviews, and technical standards for high-reliability systems
  • Leading team in tackling complex challenges related to diffusion models, transformers, and optimizing inference latency and throughput at scale
  • Fostering culture of innovation while mentoring ML engineers in distributed systems, Kubernetes, and GPU resource management
PyTorch CUDA Triton TensorRT Diffusion Models GenAI

Jul 2022 → Sep 2025

Samsung Semiconductor India R&D

Senior Machine Learning Staff Engineer

  • Performance tuning and inference optimisation for state-of-the-art LLM models for in-house NPU architecture using quantization and graph optimization
  • Research and implementation of novel RAG + LLM application for breakdown maintenance of semiconductor manufacturing equipment — built from scratch with 96% precision; applied LLM quantization on Llama 3 to reduce response time from 30s to under 10s
  • Applied research in Deep Learning for semiconductor manufacturing — improving FAB yield using AI models; conducted 100+ experiments in 2024
  • Led and mentored a team of AI researchers; defined technical roadmap and architecture for AI projects
  • Handled large datasets and developed data pipelines for model training and testing
LLM Inference Quantization RAG Deep Learning NPU

Jan 2021 → Jul 2022

Yellow.ai

Engineering Manager — NLP

  • Responsible for research, development, production, and scaling of full pipeline of message flow and intelligent NLP systems in multi-lingual and multi-modal contexts
  • Led team producing custom algorithms using PyTorch, Hugging Face, and TensorFlow
  • Prepared production-ready code with pre/post-processing and native scaling in Kubernetes and on-prem environments
  • Responsible for all MLOps and cloud infrastructure management for ML services
  • Accountable for client issues, architecture decisions, POCs, hiring, and mentorship
Multilingual NLP Kubernetes MLOps PyTorch

May 2017 → Jan 2021

Nineleaps

Team Lead — ML Platform

Team Lead — Retail Demand Forecasting · May 2019 – Jan 2021

  • Designed ML and optimization architectures for inventory forecasting across 40,000 products and thousands of stores
  • Deployed and maintained highly available forecasting application; led POCs for new forecasting methods
  • Time series analysis of KPIs; assisted business with forecast accuracy reporting
  • Built a team from scratch for successful product delivery

Team Lead — Models as a Service for Medical Analytics · May 2017 – May 2019

  • Architected and built model-serving infrastructure — auto-scaling cluster of 50+ deep-learning models handling 1M+ prediction requests per day
  • Built the model deployment lifecycle from the ground up: versioning, deployment, monitoring, and push-button deployment of ML models
  • Built ETL pipelines and querying layer for a public health knowledge graph at 1B+ edges and 200M+ nodes (Neo4j)
Demand Forecasting Apache Spark NLP Neo4j AWS Airflow

Nov 2016 → May 2017

HackerEarth

Category Head — Python

Built the Python practice section for B2C developer engagement, creating content and tooling for a high-engagement coding education platform.

Python EdTech

Aug 2015 → Oct 2016

SLK Software

Software Engineer

Software engineering at SLK — building automation tools and developing foundational software engineering instincts.

Python Automation

Dec 2011 → Jul 2015

Tata Consultancy Services

Systems Engineer

Early career in systems engineering at TCS — configuring J2EE applications, building automation tools, and developing foundational software engineering instincts.

Java / J2EE Automation Python

Tech Stack

ML / DL Frameworks

PyTorch TensorFlow Hugging Face fastText scikit-learn

Languages

Python Rust SQL Bash

Infrastructure

Kubernetes Docker AWS GCP MLflow

Specializations

LLM Inference NLP Pipelines Quantization Flash Attention ML System Design

Let's work
together.

I offer 1:1 mentorship for engineers looking to grow in ML, navigate career transitions, or sharpen their system design skills. Whether you're preparing for senior interviews or shipping your first production model, I've been there.

ML system design reviews
Career guidance for ML engineers
Mock technical interviews
Resume and portfolio review
Book on Topmate

Book a 1:1 Session

Career guidance, technical deep-dives, interview prep, and more — on your schedule.

View availability on Topmate

Get in touch

Open to collaborations, speaking opportunities, and interesting conversations.