AI / ML Engineer

I build systems that think.
Then ship them.

Engineering LLM systems, inference pipelines, and production ML infrastructure. From model optimization to scalable deployment.

ML / DL
PyTorch TensorFlow Python
Infra
AWS Docker Linux
Web
React Next.js
About

Engineering mindset, not buzzwords.

I work at the intersection of deep learning research and production systems. My focus is on making ML systems that actually work at scale — not just in notebooks, but in real infrastructure serving real traffic.

Core areas: LLM inference optimization, model quantization, transformer architectures, distributed training, and end-to-end ML pipeline design. I care about latency, throughput, and reliability as much as model accuracy.

When I write about AI, I write code. Every article on this site comes with implementation details, not just theory.

Contact

Let's talk.

Building something interesting in AI? Have a question about one of my articles? Reach out.