Chia Yu Hung
I earned my Bachelor’s degree in Computer Science from Singapore University of Technology and Design (SUTD) in May 2024. I am currently a PhD student at Nanyang Technological University (NTU), fortunately to be advised Prof. Soujanya Poria. My research interest lies in MultiModals and Vision Language Action Models. Throughout my undergraduate period, I was fortunate to have opportunity to conduct research under Prof. Roy Ka-Wei Lee and Prof. Soujanya Poria.
Current works: I am working on reinforcement learning with action conditioned world models!! Feel free to email me to chat about this!
news
| May 01, 2025 | We are excited to release Nora, a VLA based on Qwen 2.5 VL with FAST+ tokenizer!. |
|---|---|
| Jan 24, 2025 | Darwin has been accepted to NAACL 2025 (Oral)! |
| Dec 31, 2024 | We are releasing TangoFlux, a state of the art Text To audio model! |
| Jul 16, 2024 | Tango2 has been accepted to ACM MM 2024 (Oral)! |
| Jun 22, 2024 | Our new paper, Darwin is availiable! |
selected publications
- Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization2024
- Inference Time Alignment with Reward-Guided Tree Search2024
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization2024
- NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks2025