Tracking the progress in end-to-end speech translation
☆261Oct 25, 2023Updated 2 years ago
Alternatives and similar repositories for SpeechTransProgress
Users that are interested in SpeechTransProgress are comparing it to the libraries listed below
Sorting:
- ☆179Nov 10, 2021Updated 4 years ago
- CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)☆396Sep 14, 2021Updated 4 years ago
- Neural end-to-end Speech Translation Toolkit☆307Jun 28, 2022Updated 3 years ago
- A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021☆48Feb 21, 2022Updated 4 years ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆220Aug 26, 2022Updated 3 years ago
- code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)☆65May 25, 2022Updated 3 years ago
- ☆15Jun 17, 2019Updated 6 years ago
- PyTorch Implementation of TranSpeech (ICLR'23): Textless NAR Speech-to-Speech Translation with Bilateral Perturbation☆178Jun 20, 2024Updated last year
- Official code for Wav2Seq☆97Jul 19, 2022Updated 3 years ago
- Tracking the progress in non-autoregressive generation (translation, transcription, etc.)☆302Mar 15, 2023Updated 2 years ago
- End-to-end Speech Translation☆35Apr 12, 2021Updated 4 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Jun 2, 2023Updated 2 years ago
- ☆25Feb 12, 2023Updated 3 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆565Apr 2, 2023Updated 2 years ago
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆415Aug 29, 2023Updated 2 years ago
- Library for Textless Spoken Language Processing☆555Aug 29, 2023Updated 2 years ago
- ☆16Dec 23, 2021Updated 4 years ago
- Code and pretrained models for "DUB: Discrete Unit Back-translation for Speech Translation" (ACL 2023 Findings)☆28Jun 28, 2023Updated 2 years ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆48Nov 8, 2023Updated 2 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Oct 22, 2020Updated 5 years ago
- A library for speech data augmentation in time-domain☆683Aug 30, 2021Updated 4 years ago
- List of direct speech-to-speech translation papers.☆38Jan 31, 2023Updated 3 years ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆27Aug 11, 2024Updated last year
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆344Dec 25, 2020Updated 5 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆368Oct 12, 2021Updated 4 years ago
- An adaptation of Fairseq to (End-to-end) speech translation.☆22Jun 1, 2022Updated 3 years ago
- Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023☆252Jun 5, 2025Updated 9 months ago
- The Fisher and CALLHOME Spanish–English Speech Translation Corpus☆41Feb 10, 2022Updated 4 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆12Oct 25, 2023Updated 2 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- ☆13Sep 25, 2024Updated last year
- ☆276Jan 15, 2021Updated 5 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,386Jun 6, 2024Updated last year
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆191Jan 29, 2020Updated 6 years ago