sooftware / RNN-Transducer
PyTorch implementation of RNN-Transducer(RNN-T).
☆74Updated 3 years ago
Alternatives and similar repositories for RNN-Transducer:
Users that are interested in RNN-Transducer are comparing it to the libraries listed below
- Example code for a neural transducer model.☆61Updated 11 months ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆65Updated 3 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆140Updated last year
- Segment a given audio into utterances using a trained end-to-end ASR model.☆72Updated 4 years ago
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆34Updated 2 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆45Updated 3 years ago
- PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…☆100Updated 2 years ago
- This is Pytorch Implementation of Google's Non-attentive Tacotron.☆57Updated 2 years ago
- Memory efficient transducer loss computation☆68Updated 2 years ago
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Updated 3 years ago
- ☆67Updated 2 years ago
- An effort to track benchmarking results over widely-used datasets for ASR.☆46Updated 2 years ago
- A PyTorch implementation of End-to-End Neural Diarization☆101Updated last year
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆214Updated last year
- Pytorch implementation of Generalized End-to-End Loss for speaker verification☆83Updated 5 years ago
- A pytorch_lightning reimplementation of the Transducer module from ESPnet.☆75Updated 3 years ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆43Updated 2 years ago
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"☆114Updated 2 years ago
- ☆37Updated 3 years ago
- Tensorflow 2 Speech Recognition Code (Transformer)☆25Updated 4 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆102Updated 2 years ago
- An implementation of RNN-Transducer loss in TF-2.0.☆45Updated last year
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆129Updated last month
- E2E-SincNet: Toward fully end-to-end speech recognition☆29Updated 4 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆115Updated 3 years ago
- Recurrent Neural Aligner☆49Updated 4 years ago
- PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)☆131Updated 2 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago