• 内网
  • Search
  • 简体中文
  • About
    • About Us
    • Contact Us
  • Faculty
  • Admissions
    • Admission
  • Research
    • Center for AI Theoretical Foundation and Systems
    • Center for Language, Intelligence and Machines
    • Center for AI for Science and Engineering
    • Center for AI for Social Science
    • Center for Embodied Artificial Intelligence and Computer Vision
  • News
    • School News
  • Recruitment
    • Academic Positions
  • Academic Forum
    • Forum Schedule

Breadcrumb

  • Home

Wang Shuai

Associate Professor

Nanjing University

Research Field
Speech signal processing, speech synthesis and conversion, speaker modeling, target speech extraction, large speech models,Applications of audio processing in healthcare (pathological speech recognition and generation, etc.)
Email
shuaiwang@slai.edu.cn
Biography

Dr. Shuai Wang is an Associate Professor, Distinguished Researcher, and Ph.D. Supervisor at Nanjing University, with a joint appointment at Hetao College. He specializes in intelligent audio signal processing across multimodal acoustic signals (speech, audio events, music). He holds a Ph.D. from Shanghai Jiao Tong University and formerly served as Senior Researcher at Tencent Photon Studio. With 40+ publications as first/corresponding author in premier venues (ICASSP, Interspeech) and 10+ patents, Dr. Wang has won championships at VoxSRC 2019 and DIHARD 2019, plus Best Paper awards at ISCSLP 2024. His open-source project WeSpeaker achieves 10M+ monthly downloads on HuggingFace, widely adopted in academia and industry. Research collaborations and exceptional students are welcome in speech processing, multimodal AI, and large models.

Academic Publications
  1. Wang S, Chen Z, Han B, Wang H, Liang C, Zhang B, Xiang X, Ding W, Rohdin J, Silnova A, et al. Advancing Speaker Embedding Learning: Wespeaker Toolkit for Research and Production. Speech Communication, 2024.
  2. Wu W, Chen X, Wang S*, Wang J, Meng L, Wu X, Meng H, Li H. C2AV-TSE: Context and Confidence-aware Audio Visual Target Speaker Extraction. IEEE Journal of Selected Topics in Signal Processing, 2025.
  3. Ma Y, Wang S*, Liu T, Li H. PhiNet: Speaker Verification with Phonetic Interpretability. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2025.
  4. Yang C, Wang S*, Chen H, Tan W, Yu J, Li H. SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement. NeurIPS, 2025
  5. Wang W, Pan Z, Li X, Wang S, Li H. Speech Separation with Pretrained Frontend to Minimize Domain Mismatch. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024.
  6. Wang S, Yang Y, Wu Z, Qian Y, Yu K. Data Augmentation Using Deep Generative Models for Embedding Based Speaker Recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020.
Contact Us
Contact Us
  • Admissions:admission@slai.edu.cn Admissions Hotline:(86)0755 81970253 (Weekdays, 9:30–11:00 am & 3:00–5:00 pm) Faculty Recruitment:FacultyHiring@slai.edu.cn Industry-Academia Collaboration:coop@slai.edu.cn
  • Staff Careers:staff_careers@slai.edu.cn Executive Office: executiveoffice@slai.edu.cn Student Affairs: student@slai.edu.cn Bidding: bidding@slai.edu.cn Dean's Office: deanoffice@slai.edu.cn
  • Finance Office: financeoffice@slai.edu.cn Tel:0755-83590055 (Weekdays, 9:30–11:00 am & 3:00–5:00 pm) No. 6 Hongmian Road, Futian Free Trade Zone
Business Hours
  • 8:30–12:00, 13:00–17:30 (Monday to Friday) Closed on Weekends & Public Holidays

Copyright © SLAI All Rights Reserved. 粤ICP备14099122号-14 

​