facebookresearch / covost
CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)
☆347Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for covost
- Tracking the progress in end-to-end speech translation☆252Updated last year
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆510Updated last year
- Segment an audio file and obtain utterance alignments. (Python package)☆321Updated 5 months ago
- Library for Textless Spoken Language Processing☆529Updated last year
- Large, modern dataset for speech recognition☆644Updated 8 months ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆427Updated last year
- dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.☆478Updated last year
- End-to-end ASR/LM implementation with PyTorch☆594Updated 3 years ago
- ☆175Updated 2 years ago
- Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset☆152Updated 5 years ago
- PyTorch code for end-to-end spoken language understanding (SLU) with ASR-based transfer learning☆225Updated 3 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆464Updated 4 years ago
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆433Updated 7 months ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆233Updated 5 years ago
- CMU Wilderness Multilingual Speech Dataset☆273Updated 5 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆153Updated 4 years ago
- ☆272Updated 3 years ago
- A Neural Machine Translation toolkit for research purpose☆82Updated last week
- A CRF-based ASR Toolkit☆325Updated 2 months ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆200Updated 3 years ago
- A fast parallel implementation of RNN Transducer.☆307Updated last year
- Multilingual G2P in 100 languages☆285Updated last year
- ☆208Updated 11 months ago
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset☆336Updated 2 years ago
- Neural end-to-end Speech Translation Toolkit☆298Updated 2 years ago
- g2p: English Grapheme To Phoneme Conversion☆810Updated last year
- ESPnet Model Zoo☆245Updated last year
- INTERSPEECH 2019 Tutorial Materials☆193Updated 3 years ago
- A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf☆362Updated 3 years ago
- A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"☆659Updated last year