sooftware / speech-transformer
Transformer implementation speciaized in speech recognition tasks using Pytorch.
☆64Updated 3 years ago
Alternatives and similar repositories for speech-transformer:
Users that are interested in speech-transformer are comparing it to the libraries listed below
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆38Updated 3 years ago
- PyTorch implementation of automatic speech recognition models.☆38Updated 4 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆45Updated 3 years ago
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Updated 4 years ago
- This is Pytorch Implementation of Google's Non-attentive Tacotron.☆58Updated 2 years ago
- Conformer encoder + Transformer decoder with Hybrid CTC/attention☆12Updated 3 years ago
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆43Updated 2 years ago
- Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.☆35Updated 3 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆47Updated 4 months ago
- PyTorch implementation of RNN-Transducer(RNN-T).☆75Updated 4 years ago
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆26Updated 4 years ago
- A pakage for crawling audio from Youtube☆41Updated last year
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Updated 2 years ago
- A PyTorch implementation of End-to-End Neural Diarization☆109Updated last year
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆27Updated last year
- Augmentation adversarial training for self-supervised speaker recognition☆77Updated 3 years ago
- RNN-Transducer for korean☆41Updated 4 years ago
- Making Espnet easier to use☆54Updated 4 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆46Updated 3 years ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- Alignment files of LibriTTS.☆61Updated 5 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆88Updated 3 years ago
- Implementaion RNN tranceducer☆22Updated 5 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆47Updated 2 years ago
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆22Updated last month
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆67Updated 3 years ago