sadhusamik / speech_recognition_tools
☆8Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for speech_recognition_tools
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Updated 3 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated last month
- ☆20Updated 4 years ago
- Dataset simulation for DPCCN.☆14Updated last year
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆38Updated 3 years ago
- ☆15Updated 3 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 2 years ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆17Updated last month
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated 2 weeks ago
- End-to-end diarization loss☆22Updated 3 years ago
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆16Updated 2 years ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆14Updated this week
- Balanced Error Rate for Speaker Diarization☆25Updated last year
- ☆12Updated last year
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆15Updated 2 years ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆20Updated 2 months ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆15Updated 3 years ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated last year
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆24Updated 2 years ago
- Implementation of CTC alignment-based single step non-autoregressive transformer☆12Updated last year
- ☆14Updated last year
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆18Updated last year
- ☆16Updated 2 years ago
- Discriminative Training of VBx Diarization☆18Updated last month
- ☆12Updated 9 months ago
- ☆12Updated 3 years ago
- acnn for text-independent speaker recognition☆9Updated 2 years ago
- Neural network density models for speech separation.☆20Updated 3 years ago
- Efficient Speech Processing Tookit for Automatic Speaker Recognition☆17Updated last year
- ☆13Updated this week