Experience

Axon — Software Engineer

Feb 2025 – Present · Toronto, Canada

Platform engineering on a multi-region Kubernetes cluster serving 10k+ RPS. Reduced MTTR by 17% via multi-zone deployment (Kargo) and cut deployment failures by 14% with CI validation for rate-limiting services. Migrated ALPR inference from TorchServe to Triton, reducing end-to-end latency by 37% and GPU utilization by 25% across 1k RPS global traffic.

Shakudo — Machine Learning Engineer

Jan 2024 – Jan 2025 · Toronto, Canada

Built on-demand compute orchestration on Kubernetes for distributed training and scheduled inference across enterprise teams. Designed and shipped production RAG pipelines and NLP-to-SQL systems integrating commercial and open-source LLMs; fine-tuned open-source models on task-specific datasets using LoRA.

Noze — Machine Learning Engineer

Dec 2022 – Nov 2023 · Montreal, Canada

End-to-end time-series ML on aroma sensor data: data exploration, training, and deployment with Triton and TensorFlow Serving. Introduced MLflow and Ray for experiment tracking and hyperparameter tuning, reducing model development time by 18%. Built a FastAPI internal platform for preprocessing, visualization, and experiment management.

Zalo — Machine Learning Engineer

Jan 2021 – Oct 2022 · Ho Chi Minh City, Vietnam

Owned the OCR service and input validation pipeline supporting identity verification for 70M users — improved accuracy by 23% and tripled throughput. Designed and deployed VoiceID and TTS models for eKYC, achieving 0.01 RTF latency and a CMOS score of 4.3/5.

AWL Vietnam — Machine Learning Engineer

Mar 2019 – Jan 2021 · Hanoi, Vietnam

Developed a real-time self-checkout system on edge devices using Person Re-ID, optimizing for low-latency inference. Led R&D on pose estimation, attention, and action recognition for computer vision PoC delivery.

Chappiebot — Machine Learning Engineer

Oct 2017 – Feb 2019 · Ho Chi Minh City, Vietnam

Built computer vision and NLP features for a social platform; a production feature increased user engagement by 21%. Managed model training and serving on AWS (S3, ECS, SageMaker).


Education

Master of Science, Optimization and Statistics — UTT, Troyes, France (2016–2017) French Ministry of Foreign Affairs Excellence Scholarship

Bachelor of Engineering, Electronics and Computer Engineering — HCMUT, Ho Chi Minh City, Vietnam (2012–2016)


Technical Skills

ML Systems: Triton, TensorFlow Serving, MLflow, Ray, RAG pipelines, LLM fine-tuning (LoRA), NLP-to-SQL, OCR, inference optimization

Infrastructure: Kubernetes, Helm, Docker, multi-region deployment, autoscaling, API gateways, SLO/SLI, Grafana

Backend & Data: FastAPI, Flask, NestJS, PostgreSQL, Redis, Python