I'm keen on collaborating and sharing ideas to advance our understanding and application of AI 😄. If you're interested in working together or just want to chat about AI, feel free to get in touch
I am deeply committed to developing interpretable, editable, and robust machine learning approaches, though my interests extend beyond these core areas.
Representative papers are highlighted.
We proposed a part-based bird classifier that makes predictions based on part-wise descriptions. Our method directly utilizes human-provided descriptions (in this work, from GPT4). It outperforms CLIP and M&V by 10 points in CUB and 28 points in NABirds.
Knowledge-based Systems (KBS), Jul 4, 2023
Video /
Code /
Paper
This study proposes a coarse-to-fine fusion module between vision and language. This will help an agent learn a joint representation while navigating in a virtual environment.
(Challenge 1st)Expert System with Applications (ESWA), 1, November, 2022
Link Challenge /
Paper
Introducing Fruit-CoV, a two-stage vision framework, which is capable of detecting SARS-CoV-2 infections through recorded cough sounds. In this challenge, we won 100mil VND (~ $4275) for the 1st place.
International Journal of Advanced Computer Science and Applications (SAI), 14, April, 2020
Video 1 /
Video 2 /
Video 3 /
Paper
Applying Yolo3 to Particle Filter to enhance its speed and accuracy,
furthermore, an end-to-end localization framework using a stereo camera and IMU in the unknown environment.
(VNU Journal of Science (JCSCE))vieCap4H - VLSP 2021: Vietnamese Image Captioning for
Healthcare Domain using Swin Transformer and Attention-based LSTM, 5, May, 2022  
Paper /
Link Challenge /
Project Page /
Code
[Most interesting discussed idea award] Achieved 3rd on the private leaderboard, applying Swin Transformer as the Encoder (and other types), and LSTM Attention as the Decoder.
Choosen to be in the Special Issue of VNU Journal of Science (JCSCE)