Skip to content
View nhatkhangcs's full-sized avatar
🎯
Gimme a break
🎯
Gimme a break

Highlights

  • Pro

Block or report nhatkhangcs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
nhatkhangcs/README.md

Khang Vo

AI Research Engineer · NLP PhD Student · Multimodal Systems Builder

Building practical AI systems across language, vision, retrieval, and agentic workflows.

Website GitHub LinkedIn Email


Profile

I am Vo Hoang Nhat Khang (Chris), a PhD student in Natural Language Processing at Mohamed bin Zayed University of Artificial Intelligence (MBZUAI). I work on vision-language models, multimodal learning, uncertainty estimation, and Vietnamese NLP. My style is simple: take research ideas seriously, implement them cleanly, and turn prototypes into systems people can actually use.


Builder Stack

Research Vision-Language Models · Multimodal Learning · Uncertainty Estimation · Vietnamese NLP
Models LLMs · VLMs · CLIP-style models · Transformers · Retrieval-Augmented Generation
Engineering Python · C++ · Java · Backend APIs · Data pipelines · Research prototyping
Systems RAG applications · Agentic AI · Healthcare AI · Vietnamese language tools

Toolbox


Selected Builds

Project Type Description
HuTieuBERT Research Morpheme-aware Transformer work for Vietnamese NLP, accepted to ACL 2026 Main Conference.
V7 Product Vietnamese AI input method designed for faster and more flexible typing.
Namizer NLP Tool Vietnamese tokenizer that decomposes words into five linguistic components.
PhoCLIP Multimodal CLIP-style Vietnamese vision-language model for text-image understanding.
TI-JEPA Research Text-image joint-embedding predictive architecture for multimodal fusion.
LumbarCLIP Healthcare AI Multimodal CLIP-based framework for low back pain diagnosis.
VitalFit Persona Agentic AI AI-powered healthcare application with agentic workflows.
HCMUT Chatbot RAG System Retrieval-augmented chatbot for university admission support.
Synthetic Generator Data System Synthetic dataset generation system for ML workflows.

GitHub

GitHub stats Top languages

GitHub streak

GitHub contribution graph


Contact

Pinned Loading

  1. Lumbar-MedCLIP Lumbar-MedCLIP Public

    Forked from lethanhbinhxq/Lumbar-MedCLIP

    Use CLIP-based model to fine-tuning on Low Back Pain (LBP) diagnosis

    Jupyter Notebook 1

  2. tijepa tijepa Public

    Forked from ducngg/tijepa

    Official codebase for TI-JEPA, the Text-Image Joint-Embedding Predictive Architecture. First outlined in our Capstone Project Defense, got 9.9/10

    Python 3

  3. vitalfit-persona vitalfit-persona Public

    A multi-agent system specialized for healthcare-related domain

    Python 8 1

  4. v7 v7 Public

    Forked from ducngg/v7

    Super fast input method for the Vietnamese language, simplify typing through the development of a Vietnamese fast keyboard.eg. 'x0ch2' -- predict -- suggest -> 'xin chào' (meaning 'hello' in Vietna…

    Python

  5. synthetic_generator synthetic_generator Public

    Synthetic Data Generator for Machine Learning Pipelines

    Python 33 4

  6. Machine-Learning-HCMUT-Labs Machine-Learning-HCMUT-Labs Public

    This repository contains the labs of Machine Learning course at HCMUT. The lab is designed to help students understand the concepts following the slides of Dr. Nguyen Duc Dung.

    Jupyter Notebook 10 2