Yaoting WANG
Introduction
Hallo! I'm Yaoting WANG, and I'm currently a fresh year PhD student at Fudan University. My primary interests lie in Multimodal LLM, Audio-Visual Intelligence and Segmentation.
Education
- Ph.D. @ Fudan University (2025 - 2029, expected)
- M.Sc. @ University of Edinburgh (2021 - 2022)
- B.Sc. @ University of Limerick (2019 - 2021)
- B.Eng. @ Shandong University of Science and Technology (2017 - 2021)
Honours & Awards
- 1. Honours Bachelor's Degree, University of Limerick
- 2. Joint Programme Scholarship, University of Limerick
- 3. School Global Scholarship (Top 10, £150K), University of Edinburgh
Professional
-
Program Chair @ MUCG, MM'25 (07/2025 - 10/2025)
Homepage: MUCG@MM'25. -
Research Intern @ AIR, THU (02/2025 - 09/2025)
Advised by Prof. Yunxin Liu. -
Visiting Student @ MiniGPT, KAUST (04/2024 - 02/2025)
Advised by Dr. Jian Ding and Prof. Mohamed Elhoseiny. -
Research Assistant @ GSAI, RUC (03/2023 - 03/2024)
Advised by Prof. Di Hu.
News
- [01-07-2025] We hosted the 1st International Workshop on MLLM for Unified Comprehension and Generation (MUCG@MM'25), and I served as the Program Chair. Welcome paper submissions.
- [29-06-2025] Our paper "AVTrustBench: Assessing and Enhancing Reliability and Robustness in Audio-Visual LLMs" has been accepted at ICCV 2025.
- [23-05-2025] We organize the 1st International Workshop on MLLM for Unified Comprehension and Generation (MUCG) at ACM 2025, Call for Papers is now open!
- [01-05-2025] Our paper "On Path to Multimodal Generalist" has been accepted as a Spotlight/Oral presentation at ICML 2025!
- [18-03-2025] We release the first comprehensive survey on multimodal chain-of-thought reasoning, along with the Awesome-MCoT repository.
-
[30-08-2024]
- Thanks QBitAI's report for our work Ref-AVS, welcome to follow! -
[09-07-2024]
- Glad to have a speech on GAVS at Vision and Learning SEminar (VASLE)! -
[16-07-2024]
- We are excieted to release our new task Reference Audio-Visual Segmentation (Ref-AVS) and its benchmark dataset Ref-AVS Bench. How can we ask machines to locate objects of interest in the real-world with vision, audio, language... just like a human! -
[01-07-2024]
- Three papers Ref-AVS, Segmentation-Preference and Stepping-Stones have been accepted by ECCV 2024! -
[01-03-2024]
- Joined MiniGPT Group, Vision-CAIR, KAUST! -
[09-12-2023]
- One paper GAVS has been accepted by AAAI 2024 main track and ICCV 2023 AV4D workshop! -
[05-01-2023]
- Joined GeWu Lab, Gaoling School of Artificial Intelligence, Renmin University of China!