Susan Liang

Hi, there! I am a third-year Ph.D. student in the Computer Science Department at the University of Rochester. My advisor is Prof. Chenliang Xu. Before joining Prof. Xu's lab, I got my bachelor degree of Computer Science at the University of Chinese Academy of Sciences. I was lucky to study and research under the supervision of Prof. Shiguang Shan. I joined Prof. Shan's group in 2020 and had worked there for one and a half years, enjoying an exciting research experience. I also worked closely with Prof. Ming-Hsuan Yang.

My research interests lie in Computer Vision and Deep Learning, especially audio-visual learning, implicit neural fields, multi-modal learning, and trustworthy AI.

Fun Fact: my Chinese name is 梁苏叁 (Liang, Su, San), so Susan is just the *pinyin* of my Chinese name. Commonly, people think I am female when they see my English Name. There is an interesting clip about the pronunciation of Susan in the film Johnny English Reborn. :D

✉️ I am actively looking for research internships for Summer 2025. Feel free to drop me a message if you are interested in.

Susan Liang profile picture

Publications

BinauralFlow Teaser
ICML 2025

BinauralFlow: A Causal and Streamable Approach for High-Quality Binaural Speech Synthesis with Flow Matching Models

Susan Liang, Dejan Markovic, Israel D. Gebru, Steven Krenn, Todd Keebler, Jacob Sandakly, Frank Yu, Samuel Hassel, Chenliang Xu, Alexander Richard.

Forty-second International Conference on Machine Learning, Jul. 2025.

VIDCOMPOSITION Teaser
CVPR 2025

VIDCOMPOSITION: Can MLLMs Analyze Compositions in Compiled Videos?

Yunlong Tang, Junjia Guo, Hang Hua, Susan Liang, Mingqian Feng, Xinyang Li, Rui Mao, Chao Huang, Jing Bi, Zeliang Zhang, Pooyan Fazli, Chenliang Xu.

The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025, Jun. 2025.

AV Attack Teaser
ICLR 2025

Rethinking Audio-Visual Adversarial Vulnerability from Temporal and Modality Perspectives

Zeliang Zhang*, Susan Liang*, Daiki Shimada, Chenliang Xu. (* indicates equal contribution)

The Thirteenth International Conference on Learning Representations, Apr. 2025.

AI Animation Survey Teaser

Generative AI for Cel-Animation: A Survey

Yunlong Tang, Junjia Guo, Pinxin Liu, Zhiyuan Wang, Hang Hua, Jia-Xing Zhong, Yunzhong Xiao, Chao Huang, Luchuan Song, Susan Liang, and Yizhi Song, Liu He, Jing Bi, Mingqian Feng, Xinyang Li, Zeliang Zhang, Chenliang Xu.

arXiv preprint.

Train Bias Teaser

Will the Inclusion of Generated Data Amplify Bias Across Generations in Future Image Classification Models?

Zeliang Zhang, Xin Liang, Mingqian Feng, Susan Liang, Chenliang Xu.

arXiv preprint.

Scaling Concept Teaser

Scaling Concept with Text-Guided Diffusion Models

Chao Huang, Susan Liang, Yunlong Tang, Yapeng Tian, Anurag Kumar, Chenliang Xu.

arXiv preprint.

DAVIS Teaser
ACCV 2024 🏆 Best Paper Honorable Mention

High-Quality Visually-Guided Sound Separation from Diverse Categories

Chao Huang, Susan Liang, Yapeng Tian, Anurag Kumar, Chenliang Xu.

17th Asian Conference on Computer Vision, Dec. 2024.

AVEdit Teaser
ACCV 2024

Language-Guided Joint Audio-Visual Editing Via One-Shot Adaptation

Susan Liang, Chao Huang, Yapeng Tian, Anurag Kumar, Chenliang Xu.

17th Asian Conference on Computer Vision, Dec. 2024.

L2T Teaser
CVPR 2024

Learning to Transform Dynamically for Better Adversarial Transferability

Rongyi Zhu*, Zeliang Zhang*, Susan Liang, Zhuo Liu, Chenliang Xu. (* indicates equal contribution)

Conference on Computer Vision and Pattern Recognition, Jun. 2024.

Text Attack Teaser
EACL 2024

Random Smooth-based Certified Defense against Text Adversarial Attack

Zeliang Zhang, Wei Yao, Susan Liang, Chenliang Xu.

Conference of the European Chapter of the Association for Computational Linguistics, Mar. 2024.

Video LLM Survey Teaser
TCSVT 🔥🔥🔥 HOT

Video Understanding with Large Language Models: A Survey

Yunlong Tang*, Jing Bi*, Siting Xu*, Luchuan Song*, Susan Liang, Teng Wang, Daoan Zhang, Jie An, Jingyang Lin, Rongyi Zhu, Ali Vosoughi, Chao Huang, Zeliang Zhang, Feng Zheng, Jianguo Zhang, Ping Luo, Jiebo Luo, Chenliang Xu. (* indicates equal contribution)

IEEE Transactions on Circuits and Systems for Video Technology.

AV-NeRF Teaser
NeurIPS 2023

AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis

Susan Liang, Chao Huang, Yapeng Tian, Anurag Kumar, Chenliang Xu.

Conference on Neural Information Processing Systems, Dec. 2023.

NACF Teaser
ICCV Workshop 2023

Neural Acoustic Context Field: Rendering Realistic Room Impulse Response With Neural Fields

Susan Liang, Chao Huang, Yapeng Tian, Anurag Kumar, Chenliang Xu.

International Conference on Computer Vision Workshops, Oct. 2023.

UniCon Teaser
ACM MM 2021 Oral

UniCon: Unified Context Network for Robust Active Speaker Detection

Yuanhang Zhang∗, Susan Liang∗, Shuang Yang, Xiao Liu, Zhongqin Wu, Shiguang Shan, Xilin Chen. (* indicates equal contribution)

ACM International Conference on Multimedia, Oct. 2021.

Education

University of Rochester Logo

University of Rochester, NY, USA

Ph.D. Computer Science

Sept. 2022 – Present

University of Chinese Academy of Sciences Logo

University of Chinese Academy of Sciences, Beijing, China

B.Eng. Computer Science

Sept. 2018 – Jul. 2022

Research Experiences

Meta Logo

Reality Labs Research, Meta, PA, USA

Research Scientist Intern

May 2024 – Aug. 2024

Advisors: Dr. Dejan Markovic, Dr. Israel D. Gebru, and Dr. Alexander Richard

UC Merced Logo

Vision and Learning Lab, University of California - Merced, CA, USA

Research Intern

Sept. 2021 – Mar. 2022

Advisors: Prof. Ming-Hsuan Yang and Dr. Taihong Xiao

Tsinghua University Logo

Institute for AI Industry Research, Tsinghua University, Beijing, China

Research Intern

Jun. 2021 – Aug. 2021

Advisors: Dr. Yizhi Wang and Dr. Hao Xu

UCAS Logo

Visual Information Processing and Learning Group, Chinese Academy of Sciences, Beijing, China

Research Assistant

Feb. 2020 – Apr. 2021

Advisors: Prof. Shiguang Shan and Dr. Shuang Yang