facebookresearch/covost

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/covost)

facebookresearch / covost

CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)

☆401

Alternatives and similar repositories for covost

Users that are interested in covost are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kahne / SpeechTransProgress
View on GitHub
Tracking the progress in end-to-end speech translation
☆260Oct 25, 2023Updated 2 years ago
google-research-datasets / cvss
View on GitHub
CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus
☆220Aug 26, 2022Updated 3 years ago
facebookresearch / libri-light
View on GitHub
dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.
☆528Jul 11, 2023Updated 3 years ago
facebookresearch / voxpopuli
View on GitHub
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation
☆574Apr 2, 2023Updated 3 years ago
bytedance / neurst
View on GitHub
Neural end-to-end Speech Translation Toolkit
☆306Jun 28, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
facebookresearch / CPC_audio
View on GitHub
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
☆374Oct 12, 2021Updated 4 years ago
dqqcasia / awesome-speech-translation
View on GitHub
☆178Nov 10, 2021Updated 4 years ago
mattiadg / FBK-Fairseq-ST
View on GitHub
An adaptation of Fairseq to (End-to-end) speech translation.
☆22Jun 1, 2022Updated 4 years ago
kahne / NonAutoregGenProgress
View on GitHub
Tracking the progress in non-autoregressive generation (translation, transcription, etc.)
☆300Mar 15, 2023Updated 3 years ago
facebookresearch / SimulEval
View on GitHub
SimulEval: A General Evaluation Toolkit for Simultaneous Translation
☆126Sep 13, 2024Updated last year
coqui-ai / open-speech-corpora
View on GitHub
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
☆1,398Jun 6, 2024Updated 2 years ago
SpeechColab / GigaSpeech
View on GitHub
Large, modern dataset for speech recognition
☆731Feb 26, 2024Updated 2 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
s3prl / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit
☆2,556Mar 12, 2026Updated 4 months ago
espnet / icassp2020-tts
View on GitHub
ESPnet-TTS Audio Sample HP
☆21Oct 25, 2019Updated 6 years ago
facebookresearch / speech-resynthesis
View on GitHub
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…
☆416Aug 29, 2023Updated 2 years ago
ttaoREtw / semi-tts
View on GitHub
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
☆39Jul 16, 2020Updated 6 years ago
bshall / UniversalVocoding
View on GitHub
A PyTorch implementation of "Robust Universal Neural Vocoding"
☆238Nov 14, 2020Updated 5 years ago
isl-mt / fluent-fisher
View on GitHub
☆15Jun 17, 2019Updated 7 years ago
Deepest-Project / AlignTTS
View on GitHub
Implementation of the AlignTTS
☆77Jul 6, 2023Updated 3 years ago
liusongxiang / efficient_tts
View on GitHub
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
☆116Dec 22, 2021Updated 4 years ago
openaudiolab / LLaST
View on GitHub
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
☆26Aug 11, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
keonlee9420 / Parallel-Tacotron2
View on GitHub
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
☆191Nov 18, 2021Updated 4 years ago
MarkWuNLP / SemanticMask
View on GitHub
The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"
☆39Jun 9, 2020Updated 6 years ago
freewym / espresso
View on GitHub
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
☆939Sep 4, 2024Updated last year
speechio / chinese_text_normalization
View on GitHub
Chinese text normalization for speech processing
☆735Mar 18, 2023Updated 3 years ago
bootphon / phonemizer
View on GitHub
Simple text to phones converter for multiple languages
☆1,558Sep 26, 2024Updated last year
cywang97 / StreamingTransformer
View on GitHub
☆277Jan 15, 2021Updated 5 years ago
jaywalnut310 / glow-tts
View on GitHub
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
☆712Jul 12, 2022Updated 4 years ago
k2-fsa / k2
View on GitHub
FSA/FST algorithms, differentiable, with PyTorch compatibility.
☆1,348Jul 11, 2026Updated 2 weeks ago
lmnt-com / wavegrad
View on GitHub
A fast, high-quality neural vocoder.
☆299Jul 18, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
NVIDIA / mellotron
View on GitHub
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…
☆869Jul 22, 2023Updated 3 years ago
facebookresearch / textlesslib
View on GitHub
Library for Textless Spoken Language Processing
☆559Aug 29, 2023Updated 2 years ago
espnet / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆9,903Updated this week
formiel / speech-translation
View on GitHub
Multilingual speech translation
☆42Apr 15, 2021Updated 5 years ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
iamyuanchung / Autoregressive-Predictive-Coding
View on GitHub
Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning
☆191Jan 29, 2020Updated 6 years ago
Tomiinek / Multilingual_Text_to_Speech
View on GitHub
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
☆844Oct 10, 2023Updated 2 years ago