sadhusamik / speech_recognition_tools
☆8Updated 2 years ago
Related projects: ⓘ
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆38Updated 3 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 2 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆9Updated 2 months ago
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated 6 months ago
- ☆12Updated 3 years ago
- ☆20Updated 3 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Updated 4 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆11Updated 5 years ago
- End-to-end diarization loss☆19Updated 3 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆10Updated 3 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆47Updated 2 months ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆15Updated last month
- ☆12Updated last year
- ☆16Updated 2 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆19Updated this week
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated last year
- Dataset simulation for DPCCN.☆14Updated last year
- Implementation of CTC alignment-based single step non-autoregressive transformer☆11Updated last year
- ☆12Updated last week
- ☆9Updated 4 years ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆15Updated 3 years ago
- ☆15Updated 3 years ago
- ☆25Updated 3 months ago
- A collection of papers related to speech model compression☆24Updated last year
- A temporal module for PyTorch-ComplexTensor☆45Updated 2 months ago
- Speechflow for emotion recognition related information decomposition☆9Updated 3 years ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Updated 2 years ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Updated 4 years ago
- a MUSHRA compliant web audio API based experiment software☆10Updated 2 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆18Updated 11 months ago