nvidia-riva / nemo2rivaLinks
NeMo -> Riva Conversion Tool
☆16Updated last week
Alternatives and similar repositories for nemo2riva
Users that are interested in nemo2riva are comparing it to the libraries listed below
Sorting:
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Updated 5 months ago
- A toolkit for processing speech data and creating speech datasets☆133Updated this week
- Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…☆20Updated last year
- Online streaming speaker change detection model in Pytorch☆42Updated 2 years ago
- ☆54Updated last year
- Various speech datasets made available to the public☆126Updated 7 months ago
- Example code for a neural transducer model.☆65Updated last year
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆142Updated 2 months ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆156Updated 3 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆143Updated last year
- Memory efficient transducer loss computation☆68Updated 3 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆67Updated 2 months ago
- ☆61Updated last year
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆72Updated 4 months ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆72Updated last month
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆95Updated 7 months ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Updated 2 years ago
- An effort to track benchmarking results over widely-used datasets for ASR.☆47Updated 3 years ago
- PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…☆108Updated 3 years ago
- Repository for speech paper reading☆33Updated 3 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆23Updated 5 months ago
- A list of papers for child ASR☆46Updated 10 months ago
- Onnx wrapper for espnet infrernce model☆168Updated 10 months ago
- ☆68Updated 3 years ago
- MeetEval - A meeting transcription evaluation toolkit☆105Updated 3 weeks ago
- ConMamba for Automatic Speech Recognition☆80Updated last year
- Discriminative Training of VBx Diarization☆25Updated 10 months ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆78Updated last month
- Multistream CNN for Robust Acoustic Modeling☆40Updated 4 years ago