Skip to content

This repository presents a comprehensive PyTorch implementation of an end-to-end Speaker Verification system, incorporating state-of-the-art deep learning architectures and language models. The system features robust speaker recognition capabilities, with specialized support for the Vietnamese

License

Notifications You must be signed in to change notification settings

ducnt18121997/Viet-SASV

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

End-to-end Speaker Verification (/w Spoof Aware) - PyTorch Implementation

A PyTorch-based implementation of Speaker Verification utilizing deep learning architectures and language models, with support for speaker recognition on the Vietnamese dataset by Dean Nguyen.

Table of Contents

Architecture

Augmentation

Setup

conda create --name venv python=3.8.10
conda install pytorch==1.8.0 torchvision==0.9.0 torchaudio==0.8.0 cudatoolkit=11.1 -c pytorch -c conda-forge
pip install -r requirements.txt

Running

Change settings in setting/setting.yaml and run:

python cores/train.py

References

NOTE: This project was developed quite a while ago, so I may have missed some information about the repositories used. Please create a ticket so I can add them here.

License

This project is licensed under the MIT License - see the LICENSE.md file for details.

Responsibility

This implementation is provided as-is, without any warranties or guarantees. The authors are not responsible for any misuse or damage caused by this software. Users are responsible for:

  1. Ensuring proper data privacy and security when using this software
  2. Complying with all applicable laws and regulations
  3. Obtaining necessary permissions for any data used
  4. Properly citing and acknowledging the original authors of the referenced papers
  5. Understanding and accepting the limitations of the models and algorithms

The implementation is based on academic research papers and should be used for research purposes only. Commercial use may require additional permissions and compliance with relevant regulations.

Citation

If you use this code in your research, please cite:

@misc{deanng_2025,
    author = {Dean Nguyen},
    title = {End-to-end Speaker Verification - PyTorch Implementation},
    year = {2025},
    publisher = {GitHub},
    journal = {GitHub repository},
    howpublished = {\url{https://github.com/ducnt18121997/Viet-SASV}}
}

About

This repository presents a comprehensive PyTorch implementation of an end-to-end Speaker Verification system, incorporating state-of-the-art deep learning architectures and language models. The system features robust speaker recognition capabilities, with specialized support for the Vietnamese

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Languages