sooftware / speech-transformerLinks

Transformer implementation speciaized in speech recognition tasks using Pytorch.

☆64

Alternatives and similar repositories for speech-transformer

Users that are interested in speech-transformer are comparing it to the libraries listed below

Sorting:

sooftware / End-to-End-Speech-Recognition-Models
PyTorch implementation of automatic speech recognition models.
☆38Updated 4 years ago
upskyy / ContextNet
PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…
☆38Updated 3 years ago
sooftware / RNN-Transducer
PyTorch implementation of RNN-Transducer(RNN-T).
☆78Updated 4 years ago
sooftware / lightning-asr
Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
☆46Updated 4 years ago
sooftware / jasper
PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)
☆32Updated 4 years ago
sooftware / openspeech
Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
☆35Updated 3 years ago
voithru / voice-activity-detection
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
☆156Updated 3 years ago
JoungheeKim / Non-Attentive-Tacotron
This is Pytorch Implementation of Google's Non-attentive Tacotron.
☆57Updated 2 years ago
jindongwang / EasyEspnet
Making Espnet easier to use
☆56Updated 4 years ago
zerospeech / zerospeech2021_baseline
BERT and LSTM baseline models of the ZeroSpeech Challenge 2021
☆60Updated 2 years ago
zldzmfoq12 / VCtube
A pakage for crawling audio from Youtube
☆42Updated 2 years ago
speech-paper-reading / speech-paper-reading
Repository for speech paper reading
☆33Updated 3 years ago
idiap / contextual-biasing-on-gpus
Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…
☆20Updated last year
andi611 / Mockingjay-Speech-Representation
Official Implementation of Mockingjay in Pytorch
☆55Updated 2 years ago
KrishnaDN / Keyword-Transformer
Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"
☆23Updated 4 years ago
ldong1111 / GraphemeBERT
This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models
☆46Updated 3 years ago
upskyy / Squeezeformer
PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)
☆143Updated 2 years ago
iamjanvijay / rnnt
An implementation of RNN-Transducer loss in TF-2.0.
☆45Updated 2 years ago
upskyy / Transformer-Transducer
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…
☆108Updated 3 years ago
hash2430 / pitchtron
TTS for pitch-accented language. Korean dialect DB.
☆157Updated 2 years ago
skgusrb12 / voice_activity_detection
Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)
☆26Updated 4 years ago
pashanitw / W2V2-BERT-ASR-Training
☆16Updated last year
KrishnaDN / BERTphone
Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
☆17Updated 4 years ago
CARNIVAL-IITP / Speaker-Identification
발화자 지정 모듈
☆21Updated 5 months ago
Observeai-Research / Phoneme-BERT
☆34Updated 4 years ago
LEEYOONHYUNG / BVAE-TTS
Official implementation of BVAE-TTS
☆173Updated 2 years ago
Sreyan88 / LipGER
Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
☆17Updated last year
CODEJIN / Speaker_Embedding_Torch
PyTorch based speaker embedding model
☆16Updated last year
SMART-TTS / SMART-Single_Emotional_TTS
☆97Updated 2 years ago
ynop / py-ctc-decode
CTC Decoder implementation with python only. Also supports language model decoding using KenLM.
☆37Updated last year