About Me
Hi! I’m Wenxi Chen (陈文熙). I am currently a first-year Ph.D. student at the X-Lance Lab, Shanghai Jiao Tong University (SJTU), under the supervision of Prof. Xie Chen. I received my Bachelor’s degree in Computer Science (IEEE Pilot Class) from SJTU in 2025.
I’m generally interested in understanding & generation in speech and audio, as well as multimodal large language models. My previous projects have involved audio self-supervised learning, audio captioning and end-to-end spoken dialogue models.
Selected Publications
For the most up-to-date information, please visit my Google Scholar profile.
 (* indicates equal contribution)
SAC: Neural Speech Codec with Semantic-Acoustic Dual-Stream Quantization
 Wenxi Chen, Xinsheng Wang, Ruiqi Yan, Yushen Chen, Zhikang Niu, Ziyang Ma, Xiquan Li, Yuzhe Liang, Hanlin Wen, Shunshun Yin, Ming Tao, Xie Chen
 arxiv 2025
 paper / demo / code
SimulS2S-LLM: Unlocking Simultaneous Inference of Speech LLMs for Speech-to-Speech Translation
 Keqi Deng, Wenxi Chen, Xie Chen, Phil Woodland
 ACL 2025
 paper
SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training
 Wenxi Chen, Ziyang Ma, Ruiqi Yan, Yuzhe Liang, Xiquan Li, Ruiyang Xu, Zhikang Niu, Yanqiao Zhu, Yifan Yang, Zhanxun Liu, Kai Yu, Yuxuan Hu, Jinyu Li, Yan Lu, Shujie Liu, Xie Chen
 ACL 2025 Findings
 paper / demo / code
SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs
 Wenxi Chen*, Ziyang Ma*, Xiquan Li, Xuenan Xu, Yuzhe Liang, Zhisheng Zheng, Kai Yu, Xie Chen
 ICASSP 2025
 paper / code 
DRCap: Decoding CLAP Latents with Retrieval-augmented Generation for Zero-shot Audio Captioning
 Xiquan Li, Wenxi Chen, Ziyang Ma, Xuenan Xu, Yuzhe Liang, Zhisheng Zheng, Qiuqiang Kong, Xie Chen
 ICASSP 2025 (oral)
 paper / code
EAT: Self-Supervised Pre-Training with Efficient Audio Transformer
 Wenxi Chen, Yuzhe Liang, Ziyang Ma, Zhisheng Zheng, Xie Chen
 IJCAI 2024
 paper / code 
Activities
Experience
Research Intern @ Soul App
 Multimodal Interaction Group, Shanghai, China
 Advised by Dr. Xinsheng Wang
 2025.07-CURRENT
Research Intern @ Microsoft Research Asia (MSRA)
 General Artificial Intelligence Group & Speech Team, Beijing, China
 Co-advised by Dr. Shujie Liu & Dr. Jinyu Li
 2024.09-2025.06
Competition
IEEE ICME 2024 Challenge Semi-supervised Acoustic Scene Classification under Domain Shift
 Ranked 2nd, Team Leader
DCASE Challenge 2024 Task 6: Automated Audio Captioning
 Ranked 3rd, Team Leader
Awards
Rongchang Science and Technology Innovation Scholarship, 2024
CV
Here is my CV (Chinese).
Contact
Email: 1029713857@sjtu.edu.cn
