Experience
Axon — Software Engineer
Feb 2025 – Present · Toronto, Canada
Platform engineering on a multi-region Kubernetes cluster serving 10k+ RPS. Reduced MTTR by 17% via multi-zone deployment (Kargo) and cut deployment failures by 14% with CI validation for rate-limiting services. Migrated ALPR inference from TorchServe to Triton, reducing end-to-end latency by 37% and GPU utilization by 25% across 1k RPS global traffic.
Shakudo — Machine Learning Engineer
Jan 2024 – Jan 2025 · Toronto, Canada
Built on-demand compute orchestration on Kubernetes for distributed training and scheduled inference across enterprise teams. Designed and shipped production RAG pipelines and NLP-to-SQL systems integrating commercial and open-source LLMs; fine-tuned open-source models on task-specific datasets using LoRA.
Noze — Machine Learning Engineer
Dec 2022 – Nov 2023 · Montreal, Canada
End-to-end time-series ML on aroma sensor data: data exploration, training, and deployment with Triton and TensorFlow Serving. Introduced MLflow and Ray for experiment tracking and hyperparameter tuning, reducing model development time by 18%. Built a FastAPI internal platform for preprocessing, visualization, and experiment management.
Zalo — Machine Learning Engineer
Jan 2021 – Oct 2022 · Ho Chi Minh City, Vietnam
Owned the OCR service and input validation pipeline supporting identity verification for 70M users — improved accuracy by 23% and tripled throughput. Designed and deployed VoiceID and TTS models for eKYC, achieving 0.01 RTF latency and a CMOS score of 4.3/5.
AWL Vietnam — Machine Learning Engineer
Mar 2019 – Jan 2021 · Hanoi, Vietnam
Developed a real-time self-checkout system on edge devices using Person Re-ID, optimizing for low-latency inference. Led R&D on pose estimation, attention, and action recognition for computer vision PoC delivery.
Chappiebot — Machine Learning Engineer
Oct 2017 – Feb 2019 · Ho Chi Minh City, Vietnam
Built computer vision and NLP features for a social platform; a production feature increased user engagement by 21%. Managed model training and serving on AWS (S3, ECS, SageMaker).
Education
Master of Science, Optimization and Statistics — UTT, Troyes, France (2016–2017) French Ministry of Foreign Affairs Excellence Scholarship
Bachelor of Engineering, Electronics and Computer Engineering — HCMUT, Ho Chi Minh City, Vietnam (2012–2016)
Technical Skills
ML Systems: Triton, TensorFlow Serving, MLflow, Ray, RAG pipelines, LLM fine-tuning (LoRA), NLP-to-SQL, OCR, inference optimization
Infrastructure: Kubernetes, Helm, Docker, multi-region deployment, autoscaling, API gateways, SLO/SLI, Grafana
Backend & Data: FastAPI, Flask, NestJS, PostgreSQL, Redis, Python