shkim816 / acnn_speaker_recogView external linksLinks
acnn for text-independent speaker recognition
☆10Feb 8, 2022Updated 4 years ago
Alternatives and similar repositories for acnn_speaker_recog
Users that are interested in acnn_speaker_recog are comparing it to the libraries listed below
Sorting:
- TDY-CNN for text-independent speaker verification☆19Nov 7, 2022Updated 3 years ago
- ☆10Dec 22, 2023Updated 2 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Jun 30, 2023Updated 2 years ago
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆13Apr 7, 2021Updated 4 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Dec 10, 2024Updated last year
- NMT based punctuation prediction system using lexical and acoustic features .☆14Mar 30, 2020Updated 5 years ago
- Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…☆17Apr 27, 2023Updated 2 years ago
- PyTorch based speaker embedding model☆16Apr 13, 2024Updated last year
- ☆18Mar 4, 2023Updated 2 years ago
- Open source cross-platform implementation of MRCP protocol☆20Mar 3, 2022Updated 3 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Dec 10, 2020Updated 5 years ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Mar 7, 2023Updated 2 years ago
- ☆15May 8, 2021Updated 4 years ago
- Calculates the Word Error Rate between two text files☆20Nov 10, 2022Updated 3 years ago
- Framework for one-shot multispeaker system based on Deep Learning☆19May 30, 2021Updated 4 years ago
- Deploy Kaldi models using grpc for bidirectional streaming.☆17Sep 30, 2024Updated last year
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37May 25, 2021Updated 4 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- ☆17Aug 27, 2025Updated 5 months ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆21Aug 13, 2024Updated last year
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Aug 24, 2023Updated 2 years ago
- freeswitch百度语音识别模块☆25Feb 16, 2021Updated 5 years ago
- Deep Speech Distances PyTorch☆29Feb 21, 2022Updated 3 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆54Sep 14, 2022Updated 3 years ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆23Jul 16, 2024Updated last year
- ☆67Sep 13, 2024Updated last year
- Collection of self-supervised models for speaker and language recognition tasks.☆19Jan 18, 2022Updated 4 years ago
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆33Jun 14, 2024Updated last year
- Implementing VGGVox for Speaker Identification on VoxCeleb1 dataset in PyTorch.☆25Oct 15, 2020Updated 5 years ago
- Unofficial Pytorch Implementation of WaveGrad2☆112Aug 18, 2021Updated 4 years ago
- Python toolkit for speech processing☆72Jan 16, 2026Updated last month