sooftware / lightning-asrLinks

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

☆46

Alternatives and similar repositories for lightning-asr

Users that are interested in lightning-asr are comparing it to the libraries listed below

Sorting:

sooftware / speech-transformer
Transformer implementation speciaized in speech recognition tasks using Pytorch.
☆64Updated 3 years ago
sooftware / RNN-Transducer
PyTorch implementation of RNN-Transducer(RNN-T).
☆78Updated 4 years ago
ldong1111 / GraphemeBERT
This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models
☆46Updated 3 years ago
deepaudio / deepaudio-speaker
neural network based speaker embedder
☆25Updated 2 years ago
iamjanvijay / rnnt
An implementation of RNN-Transducer loss in TF-2.0.
☆45Updated 2 years ago
sooftware / End-to-End-Speech-Recognition-Models
PyTorch implementation of automatic speech recognition models.
☆38Updated 4 years ago
zldzmfoq12 / VCtube
A pakage for crawling audio from Youtube
☆42Updated 2 years ago
ynop / py-ctc-decode
CTC Decoder implementation with python only. Also supports language model decoding using KenLM.
☆37Updated last year
Deepest-Project / Transformer-TTS
Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"
☆64Updated 2 years ago
jefflai108 / Semi-Supervsied-Spoken-Language-Understanding-PyTorch
Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining
☆12Updated 4 years ago
YoungloLee / tf2-speech-recognition-transformer
Tensorflow 2 Speech Recognition Code (Transformer)
☆25Updated 5 years ago
JoungheeKim / Non-Attentive-Tacotron
This is Pytorch Implementation of Google's Non-attentive Tacotron.
☆57Updated 2 years ago
MiniXC / LightningFastSpeech2
☆56Updated 2 years ago
cornerfarmer / ctc_segmentation
Segment a given audio into utterances using a trained end-to-end ASR model.
☆73Updated 4 years ago
RuABraun / texterrors
☆37Updated 3 months ago
nvidia-riva / riva-asrlib-decoder
Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva
☆91Updated 5 months ago
VITA-Group / Audio-Lottery
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆31Updated 3 years ago
voidful / SpeechMix
Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together
☆47Updated last month
diego-fustes / asr-rescoring
Rescoring methods for end-to-end Automatic Speech Recognition
☆27Updated 4 years ago
speech-paper-reading / speech-paper-reading
Repository for speech paper reading
☆33Updated 3 years ago
daanzu / wav2vec2_stt_python
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…
☆24Updated 3 years ago
aispeech-lab / w2v-cif-bert
☆38Updated 4 years ago
zerospeech / zerospeech2021_baseline
BERT and LSTM baseline models of the ZeroSpeech Challenge 2021
☆60Updated 2 years ago
neosapience / editts
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)
☆117Updated 2 years ago
upskyy / ContextNet
PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…
☆38Updated 3 years ago
Observeai-Research / Phoneme-BERT
☆34Updated 4 years ago
KrishnaDN / BERTphone
Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
☆17Updated 4 years ago
pashanitw / W2V2-BERT-ASR-Training
☆16Updated last year
vectominist / MiniASR
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
☆54Updated 2 years ago
asappresearch / sew
☆76Updated 3 years ago