sooftware / openspeech
Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
☆34Updated 3 years ago
Alternatives and similar repositories for openspeech:
Users that are interested in openspeech are comparing it to the libraries listed below
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆65Updated 3 years ago
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆26Updated 3 years ago
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆150Updated 3 years ago
- PyTorch implementation of automatic speech recognition models.☆38Updated 4 years ago
- PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…☆100Updated 2 years ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆43Updated 2 years ago
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆34Updated 2 years ago
- Tensorflow 2 Speech Recognition Code (Transformer)☆25Updated 4 years ago
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- E2E-SincNet: Toward fully end-to-end speech recognition☆29Updated 4 years ago
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Updated 3 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- Conformer encoder + Transformer decoder with Hybrid CTC/attention☆12Updated 3 years ago
- Repository for speech paper reading☆32Updated 3 years ago
- A pakage for crawling audio from Youtube☆41Updated last year
- Implementaion RNN tranceducer☆21Updated 5 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆57Updated last year
- Recurrent Neural Aligner☆49Updated 4 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆72Updated 4 years ago
- RNN-Transducer for korean☆39Updated 4 years ago
- An official implementation of the ICASSP 2023 paper: SG-VAD: Stochastic Gates Based Speech Activity Detection☆24Updated 6 months ago
- This is Pytorch Implementation of Google's Non-attentive Tacotron.☆57Updated 2 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆102Updated 2 years ago
- PyTorch implementation of RNN-Transducer(RNN-T).☆74Updated 3 years ago
- A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.☆44Updated last year
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆53Updated 2 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- A unofficial Pytorch implementation of Google's VoiceFilter☆99Updated last year
- 로봇의 감정 및 개성을 표현할 수 있는 대화형 음성합성 오픈소스 플랫폼☆105Updated 3 years ago