Dung Vo Pham Tuan

AI Researcher | AI Engineer

I graduated from Ho Chi Minh University of Technology (HCMUT) with a bachelor's degree and am currently pursuing a master's degree. With a strong background in mathematics and a passion for AI, I aspire to become an AI expert specializing in Multimodal Models.

Dung Vo Pham Tuan

About Me

I graduated from Ho Chi Minh University of Technology (HCMUT) with a bachelor's degree and am currently pursuing a master's degree. With a strong background in mathematics and a passion for AI, I aspire to become an AI expert specializing in Multimodal Models.

My research interests include Computer Vision, Natural Language Processing, and Multimodal AI systems. I currently work as an AI Model Research and Development Engineer where I focus on multilingual vision-language models and text-based person re-identification.

Skills Highlight

Core Knowledge

  • Deep Learning
  • Computer Vision
  • Natural Language Processing
  • Probability & Statistics

ML/DL Frameworks

  • TensorFlow
  • PyTorch
  • scikit-learn
  • XGBoost

Featured Projects

Explore my recent work in AI, machine learning, and software development. These projects showcase my technical skills and problem-solving approach.

Knowledge Distillation for Coding Multi-Choice Coding Question Answering

Knowledge Distillation for Coding Multi-Choice Coding Question Answering

Individual Project | Developed knowledge Distillation Framework for Structured LLM Reasoning

Knowledge Distillation for Coding Multi-Choice Coding Question Answering

Implemented an open-source knowledge distillation framework (GitHub repo) to transfer structured reasoning from GPT-4o to a mini-LLM (Qwen2.5 Coder 1.5B Instruct) for Coding Multi-Choice Coding Question Answering.

GPTLLMRAGHugging Face
Semantic Search with Large Language Model and Vector Database

Semantic Search with Large Language Model and Vector Database

Individual Project | Developed production-ready RAG system for document search and question answering

Semantic Search with Large Language Model and Vector Database

Designed and implemented a full-stack RAG system using FastAPI, Weaviate, and OpenAI, with a self-hosted vector database for data control and privacy.

FastAPIWeaviateVector DatabaseRAG
Recognizing Human Activities from Images

Recognizing Human Activities from Images

Individual Project | Developed a modern approach for Human Action Recognition

Recognizing Human Activities from Images

Worked with the Human Action Recognition benchmark from a Kaggle contest.

CLIPPyTorchTransformerMLflow
Building AI Agents for Puzzle Games

Building AI Agents for Puzzle Games

Team Project | Lead of Team-4 | Developing AI Agents

Building AI Agents for Puzzle Games

Led a team of four, responsible for coordinating overall project development and managing the project timeline.

Reinforcement LearningPyTorchPyGame

Professional Experience

AI Research and Development Engineer

Dien Toan Group

Jul 2024 -- Apr 2025

Tan Binh District, HCM City

  • Pretrained a multilingual vision-language backbone (Vietnamese/English/Chinese) for Text-based Person Re-identification on a large-scale dataset (36 million image-text pairs) using 4 NVIDIA A100 GPUs.
  • Fine-tuned a model for Text-based Person Re-identification and collaborated with the Deployment Team for serving it.
  • Kept up with advancements in Large Language Models (LLMs), Multimodal Large Language Models (MLLMs) and inference frameworks to accelerate data augmentation and increase the diversity of the pretraining dataset.
  • Extended the original English-only pretraining dataset by adding Chinese and Vietnamese captioning annotations. Conducted experiments to demonstrate that multilingual pretraining improves zero-shot retrieval performance of the backbone in each language by over 1.2% Rank-1.
  • Augmented the pretraining dataset with large-scale open-vocabulary detection/segmentation annotations and attribute tagging annotations in 1-2 days through the utilization of SOTA models, then enhanced pretraining by integrating with new additional tasks, improving zero-shot retrieval performance above 1.0%.

Click to view more

AI Researcher

Dien Toan Group

Oct 2023 -- Jun 2024

Tan Binh District, HCM City

  • Proposed the company change from fixed-attribute category person re-identification to Vietnamese Text-based Person Re-identification that is practical for Vietnamese.
  • Making this feature the company's flagship AI product compared to other opponents, attracting attention from government agencies.
  • Constructed the first Vietnamese pretraining dataset and Vietnamese benchmark dataset for this task, improving fine-tuning efficiency and model generalization.
  • Developed a Vietnamese Vision-Language backbone based on ALBEF, leveraging SOTA Vietnamese backbone PhoBERT/Videberta for text encoder and HAP/SOLIDER (human-centric image encoder) for vision encoder.

Click to view more

AI Research Intern

Dien Toan Group

Jun 2023 -- Jul 2023

Tan Binh District, HCM City

  • Conducted research on Transformer-based architectures for Object Detection and Multiple object Tracking.
  • Applied Trackformers (Facebook AI Research, CVPR 2023) for tracking pedestrians and vehicles at the campus of Ho Chi Minh University of Technology.
  • Preprocessed realistic surveillance video data and evaluated multiple efficient data annotation tools.
  • Optimized Trackformers by modifying loss functions and architecture to extend from single-class (human-only) to multi-class tracking and mitigate class imbalance in realistic training dataset.

Click to view more

Research Assistant

Data Science Lab, CSE Faculty, HCMUT

Aug 2023 -- Mar 2025

District 10, HCM City

  • Conducted academic research on Text-based Person Re-identification under the supervision of the Lab Head, who is also the founder of Tris Company. Currently co-authoring a research paper for submission to a high-impact journal.
  • Developed a state-of-the-art model, achieving a 2.8% Rank-1 accuracy improvement on benchmark datasets over recent SOTA models, making it the highlighted AI product of the lab and a benchmark for future research.

Education

Master's Degree in Computer Science

Ho Chi Minh University of Technology

Jan 2024 -- Present

  • Specializing in Applied Data Science
  • Current GPA: 8.48/10 (24/60 credits)

Click to view details

Bachelor's Degree in Computer Science

Ho Chi Minh University of Technology

Aug 2020 -- Nov 2024

  • Honors Degree with dual specializations in Image Processing & Computer Vision and Applied Artificial Intelligence
  • GPA: 8.69/10 (3.8/4) - Thesis Score: 9.7/10 (AI Research)

Click to view details

High School Diploma

Quang Trung High School for the Gifted, Binh Phuoc

Aug 2017 -- Jul 2020

  • Specialized in Mathematics with a GPA of 9.4/10
  • Direct Admission to University due to Third Prize, Vietnam Mathematical Olympiad (VMO) 2020

Click to view details

Certifications

PagerDuty logo

DevOps Professional Certificate & LinkedIn

by PagerDuty

Mar 2025

Duke University logo

Building Cloud Computing Solutions at Scale Specialization & Coursera

by Duke University

Aug 2024

NVIDIA logo

Building Real-Time Video AI Applications

by NVIDIA

Aug 2024

Weaviate logo

Vector Databases Professional Certificate

by Weaviate

Jul 2024

Docker logo

Foundations Professional Certificate , Inc

by Docker

Jun 2024

Duke University logo

Large Language Model Operations (LLMLOps) Specialization & Coursera

by Duke University

Jun 2024

Duke University logo

Machine Learning Operations (MLOps) Specialization & Coursera

by Duke University

Jun 2024

Microsoft logo

Azure Data Scientist Associate (DP-100) Exam Prep Professional Certificate

by Microsoft

Jun 2024

IBM logo

Generative AI for Data Scientists Specialization

by IBM

May 2024

IBM logo

Machine Learning Professional Certificate

by IBM

May 2024

VietAI logo

Advances In Natural Language Processing Specialization & New Turing Institute

by VietAI

Mar 2024

DeepLearning.AI logo

Machine Learning Engineering for Production (MLOps) Specialization

by DeepLearning.AI

Feb 2024

NVIDIA logo

Generative AI with Diffusion Models

by NVIDIA

Aug 2024

Databricks logo

Large Language Models Professional Certificate

by Databricks

Oct 2023

DeepLearning.AI logo

Generative Adversarial Networks (GANs) Specialization

by DeepLearning.AI

Jul 2023

IBM logo

AI Engineering Professional Certificate

by IBM

Jul 2023

DeepLearning.AI logo

Machine Learning Specialization by Stanford University |

by DeepLearning.AI

Jul 2023

DeepLearning.AI logo

Natural Language Processing Specialization

by DeepLearning.AI

Jul 2023

DeepLearning.AI logo

Deep Learning Specialization

by DeepLearning.AI

Jun 2023

DeepLearning.AI logo

TensorFlow Developer Professional Certificate

by DeepLearning.AI

Jun 2023

Honors & Awards

  • 🏆Student of Five Merits at Vietnam National University level

    Nov 2024
  • 🏆Student of Five Merits at Ho Chi Minh City level

    Nov 2024
  • 🏆Third Prize, Faculty Thesis Poster Competition For Talent Students (Top 3 Thesis)

    May 2024
  • 🏆HCMUT Incentive Scholarship for Outstanding Students

    Sep 2023
  • 🏆Odon Vallet Scholarship For Outstanding Vietnamese Students

    Sep 2020
  • 🏆Third Prize, Vietnam Mathematical Olympiad (VMO)

    Jan 2020
  • 🏆Consolation Prize, Vietnam Mathematical Olympiad (VMO)

    Jan 2019
  • 🏆Gold Medal with Top 5, Traditional Mathematical Olympiad 30/04, Middle & Southern Vietnam

    Mar 2019
  • 🏆Gold Medal with Top 1, Traditional Mathematical Olympiad 30/04, Middle & Southern Vietnam

    Mar 2018

Get In Touch

I'm always open to discussing new projects, research opportunities, or collaboration possibilities. Feel free to reach out through any of the following channels:

📧

Email

vophamtuandung05hv@gmail.com

📱

Phone

(+84) 914-684-539

Send a Message