Yaoting Wang
Home
Publications
Gallery
CV
Publications
Selected research contributions
Conference Papers
ICCV
2025
AVTrustBench: Assessing and Enhancing Reliability and Robustness in Audio-Visual LLMs
Sanjoy Chowdhury, Sayan Nag, Subhrajyoti Dasgupta,
Yaoting Wang
, Mohamed Elhoseiny, Ruohan Gao, Dinesh Manocha
arXiv
ICML
Oral
On Path to Multimodal Generalist: General-level and General-bench
Hao Fei*, Yuan Zhou*, Juncheng Li*, Xiangtai Li*, Qingshan Xu*, Bobo Li*, Shengqiong Wu*,
Yaoting Wang
, Junbao Zhou, Jiahao Meng, et al.
Project
ECCV
2024
Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes with Natural Language
Yaoting Wang*
, Peiwen Sun*, Dongzhan Zhou, Guangyao Li, Honggang Zhang, Di Hu
arXiv
Project
ECCV
2024
Can Textual Semantics Mitigate Sounding Object Segmentation Preference?
Yaoting Wang*
, Peiwen Sun*, Yuanchao Li, Honggang Zhang, Di Hu
arXiv
ECCV
2024
Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation
Juncheng Ma, Peiwen Sun,
Yaoting Wang
, Di Hu
arXiv
AAAI
2024
Prompting Segmentation with Sound is Generalizable Audio-visual Source Localizer
Yaoting Wang*
, Weisong Liu*, Guangyao Li, Jian Ding, Di Hu, Xi Li
arXiv
IEEE SCC
Scaling Up Mobile Service Selection in Edge Computing Environment with Cuckoo Optimization Algorithm
Ming Zhu, Feilong Yu, Xiukun Yan, Jing Li,
Yaoting Wang
Pre-prints
arXiv
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Yaoting Wang
, Shengqiong Wu, Yuechen Zhang, Shuicheng Yan, Ziwei Liu, Jiebo Luo, Hao Fei
Awesome-MCoT
-
arXiv
Cross-Attention is Not Enough: Incongruity-Aware Dynamic Hierarchical Fusion for Multimodal Affect Recognition
Yaoting Wang*
, Yuanchao Li*, Paul Pu Liang, Louis-Philippe Morency, Peter Bell, Catherine Lai