Skip to main content

Automatic Speaker Verification

Our AI team focuses on cutting-edge artificial intelligence research and applications.

Introduction

Automatic Speaker Verification (ASV) is a subfield of speech processing and biometrics that aims to verify a person's claimed identity from speech signals. With the rapid development of deep learning, ASV has made significant progress in creating applications that can bring many benefits to life. Some applications of ASV include biometric authentication for e-banking, secure access control, forensic speaker identification, and personalized voice assistant services.

Our research group focuses on exploiting machine learning and deep learning techniques, incorporating with acoustic features and speaker-specific embeddings to develop high-performance ASV systems. We also investigate methods to construct large-scale and multi-genre speech datasets specifically for the Vietnamese language, such as the VoxVietnam, Vietnam Celeb and VSASV corpuses. Furthermore, we create robust models for spoof detection—including countermeasures against replay, text-to-speech, and voice conversion attacks—to ensure system security in cross-corpus and real-world scenarios.

Contact: Phuong Tuan Dat | ✉️ phuongtuandat2915@gmail.com

Research Direction

  • Vietnamese Speaker Recognition Datasets: Focusing on the development of high-quality, large-scale corpora tailored for the Vietnamese language. This involves creating novel construction pipelines that resolve the problem of high-proportion label noises for large-scale data retrieval, specifically for Vietnamese speakers.
  • Deep Speaker Recognition Models: Investigating state-of-the-art architectures and Self-Supervised Learning (SSL) frameworks to extract robust speaker embeddings.
  • Speech Spoof Detection Models: Developing advanced countermeasures to distinguish between "bonafide" human speech and malicious "spoof" attacks, including AI-generated deepfakes, text-to-speech, and replay

Members

Phuong Tuan Dat

Phuong Tuan Dat

Team Leader

Pham Viet Hoang

Pham Viet Hoang

Researcher

Ho Bao Thu

Ho Bao Thu

Researcher

Nguyen Tran Trung

Nguyen Tran Trung

Researcher

Latest Publications

  1. P. T. Dat, V. H. L, N. T. T. Trang. The Vietnamese Spoofing-aware Speaker Verification Challenge 2025: Summary and Results. VLSP 2025. 71–77. Hanoi, Vietnam. 01/2025
  2. P. V. Hoang, H. B. Thu, H. V. Khanh. SV++'s Vietnamese Spoofing-Aware Speaker Verification Systems for VLSP 2025. VLSP 2025. 82–88. Hanoi, Vietnam. 01/2025
  3. N. T. Trung, T. D. An, C. H. Viet. SVBK System Description to the VLSP 2025 Challenge on Vietnamese Spoofing-Aware Speaker Verification. VLSP 2025. 78–81. Hanoi, Vietnam. 01/2025
  4. H. L. Vu, P. T. Dat, P. T. Nhi, N. S. Hao, N. T. T. Trang. VoxVietnam: a Large-Scale Multi-Genre Dataset for Vietnamese Speaker Recognition. ICASSP 2025. Hyderabad, India. 04/2025
  5. V. Hoang, V. T. Pham, H. N. Xuan, P. Nhi, P. Dat, T. T. T. Nguyen. VSASV: a Vietnamese Dataset for Spoofing-Aware Speaker Verification. Interspeech 2024. Kos, Greece. 09/2024
  6. V. T. Pham, X. T. H. Nguyen, V. Hoang, T. T. T. Nguyen. Vietnamceleb: a Large-Scale Dataset for Vietnamese Speaker Recognition. Interspeech 2023. 1918–1922. Dublin, Ireland. 08/2023