Chia Yu Hung

IMG_0860.jpg

I am currently a 1st year PhD student at Nanyang Technological University (NTU), fortunately to be advised Prof. Soujanya Poria. My research interest lies in MultiModals and Vision Language Action Models.

Previously, I earned my Bachelor’s degree in Computer Science from Singapore University of Technology and Design (SUTD) in May 2024. Throughout my undergraduate period, I was fortunate to have opportunity to conduct research under Prof. Roy Ka-Wei Lee and Prof. Soujanya Poria.

Current works: I am working on reinforcement learning with action conditioned world models!! Feel free to email me to chat about this!

news

Jan 27, 2026 TangoFlux has been accepted to ICLR 2026!
Jan 27, 2026 TangoFlux has been accepted to ICLR 2026!
Nov 20, 2025 We are releasing NORA-1.5, a VLA post-trained using action conditioned world model as reward.
May 01, 2025 We are excited to release Nora, a VLA based on Qwen 2.5 VL with FAST+ tokenizer!.
Jan 24, 2025 Darwin has been accepted to NAACL 2025 (Oral)!

selected publications

  1. Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization
    Navonil Majumder*, Chia-Yu Hung*, Deepanway Ghosal*, Wei-Ning Hsu , and 2 more authors
    2024
  2. Inference Time Alignment with Reward-Guided Tree Search
    Chia-Yu Hung, Navonil Majumder, Ambuj Mehrish, and Soujanya Poria
    2024
  3. TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization
    Chia-Yu Hung, Navonil Majumder, Zhifeng Kong, Ambuj Mehrish , and 3 more authors
    2024
  4. NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks
    Chia-Yu Hung, Qi Sun, Pengfei Hong, Amir Zadeh , and 4 more authors
    2025