I am a PhD student at Auburn University, Alabama, US where I am working on XAI in multimodalities. I am also passionate about Embodied AI where an AI can have the ability to understand not only language but also other different modalities such as vision, sound.
At Auburn University, I've been working under the supervision of Professor Anh Nguyen.
We proposed a part-based bird classifier that makes predictions based on part-wise descriptions. Our method directly utilizes human-provided descriptions (in this work, from GPT4). It outperforms CLIP and M&V by 10 points in CUB and 28 points in NABirds.
We are the first to explore integrating habitat information, one of the four major cues for identifying birds by ornithologists, into modern bird classifiers.
Knowledge-based Systems (KBS), Jul 4, 2023
Video /
Code /
Paper
This study proposes a coarse-to-fine fusion module between vision and language. This will help an agent learn a joint representation while navigating in a virtual environment.
(Challenge 1st)Expert System with Applications (ESWA), 1, November, 2022
Link Challenge /
Paper
Introducing Fruit-CoV, a two-stage vision framework, which is capable of detecting SARS-CoV-2 infections through recorded cough sounds. In this challenge, we won 100mil VND (~ $4275) for the 1st place.
International Journal of Advanced Computer Science and Applications (SAI), 14, April, 2020
Video 1 /
Video 2 /
Video 3 /
Paper
Applying Yolo3 to Particle Filter to enhance its speed and accuracy,
furthermore, an end-to-end localization framework using a stereo camera and IMU in the unknown environment.
(VNU Journal of Science (JCSCE))vieCap4H - VLSP 2021: Vietnamese Image Captioning for
Healthcare Domain using Swin Transformer and Attention-based LSTM, 5, May, 2022  
Paper /
Link Challenge /
Project Page /
Code
[Most interesting discussed idea award] Achieved 3rd on the private leaderboard, applying Swin Transformer as the Encoder (and other types), and LSTM Attention as the Decoder.
Choosen to be in the Special Issue of VNU Journal of Science (JCSCE)