georgesterpu / TarisLinks
Transformer-based online speech recognition system with TensorFlow 2
☆26Updated 4 years ago
Alternatives and similar repositories for Taris
Users that are interested in Taris are comparing it to the libraries listed below
Sorting:
- ☆52Updated 4 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆40Updated 4 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 5 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago
- ☆36Updated 3 years ago
- Baseline kaldi script for UA-SPEECH corpus☆30Updated 9 months ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- ☆27Updated 3 years ago
- Official Implementation of Mockingjay in Pytorch☆54Updated 2 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆40Updated 4 years ago
- Experiments on speech recognition robustness to accents and dialects☆12Updated 6 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- Speech (audio) subjective evaluation system☆39Updated 5 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Updated 2 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆20Updated 3 years ago
- ☆36Updated 4 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Updated 2 years ago
- ☆16Updated 6 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆53Updated 5 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 6 years ago
- Text to Speech Synthesis based on controllable latent representation☆14Updated 5 years ago
- Implementation of Multi speaker TTS☆51Updated 4 years ago
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆37Updated 3 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆74Updated 5 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆60Updated 4 years ago
- ☆40Updated 3 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆74Updated 2 years ago