kahne/SpeechTransProgress

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kahne/SpeechTransProgress)

kahne / SpeechTransProgress

Tracking the progress in end-to-end speech translation

☆260

Alternatives and similar repositories for SpeechTransProgress

Users that are interested in SpeechTransProgress are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dqqcasia / awesome-speech-translation
View on GitHub
☆178Nov 10, 2021Updated 4 years ago
facebookresearch / covost
View on GitHub
CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)
☆401Sep 14, 2021Updated 4 years ago
bytedance / neurst
View on GitHub
Neural end-to-end Speech Translation Toolkit
☆306Jun 28, 2022Updated 4 years ago
Glaciohound / Chimera-ST
View on GitHub
A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021
☆47Feb 21, 2022Updated 4 years ago
ReneeYe / ConST
View on GitHub
code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)
☆64May 25, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
google-research-datasets / cvss
View on GitHub
CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus
☆220Aug 26, 2022Updated 3 years ago
isl-mt / fluent-fisher
View on GitHub
☆15Jun 17, 2019Updated 7 years ago
dqqcasia / st
View on GitHub
End-to-end Speech Translation
☆35Apr 12, 2021Updated 5 years ago
kahne / NonAutoregGenProgress
View on GitHub
Tracking the progress in non-autoregressive generation (translation, transcription, etc.)
☆300Mar 15, 2023Updated 3 years ago
Rongjiehuang / TranSpeech
View on GitHub
PyTorch Implementation of TranSpeech (ICLR'23): Textless NAR Speech-to-Speech Translation with Bilateral Perturbation
☆183Jun 20, 2024Updated 2 years ago
Rongjiehuang / awesome-speech-to-speech-translation
View on GitHub
List of direct speech-to-speech translation papers.
☆39Jan 31, 2023Updated 3 years ago
facebookresearch / voxpopuli
View on GitHub
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation
☆574Apr 2, 2023Updated 3 years ago
fengpeng-yue / speech-to-speech-translation
View on GitHub
☆25Feb 12, 2023Updated 3 years ago
asappresearch / wav2seq
View on GitHub
Official code for Wav2Seq
☆97Jul 19, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ictnlp / BT4ST
View on GitHub
Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".
☆11Oct 25, 2023Updated 2 years ago
babe269 / performant
View on GitHub
A toolset for easy formant extraction and visualization from wav files and TTS models
☆33Sep 2, 2022Updated 3 years ago
mattiadg / FBK-Fairseq-ST
View on GitHub
An adaptation of Fairseq to (End-to-end) speech translation.
☆22Jun 1, 2022Updated 4 years ago
eastonYi / Unsupervised-ASR
View on GitHub
unsupervised ASR (mainly phone classifier) using EODM and GAN
☆12Oct 22, 2020Updated 5 years ago
ictnlp / STEMM
View on GitHub
Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".
☆35Oct 25, 2023Updated 2 years ago
mct10 / CoBERT
View on GitHub
Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
☆48Nov 8, 2023Updated 2 years ago
0nutation / DUB
View on GitHub
Code and pretrained models for "DUB: Discrete Unit Back-translation for Speech Translation" (ACL 2023 Findings)
☆28Jun 28, 2023Updated 3 years ago
ictnlp / CMOT
View on GitHub
Code for ACL 2023 main conference paper "CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation"
☆17Oct 29, 2024Updated last year
ashi-ta / speechGLUE
View on GitHub
SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.
☆13Jun 2, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
openaudiolab / LLaST
View on GitHub
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
☆26Aug 11, 2024Updated last year
facebookresearch / textlesslib
View on GitHub
Library for Textless Spoken Language Processing
☆559Aug 29, 2023Updated 2 years ago
danliu2 / caat
View on GitHub
☆35Sep 1, 2022Updated 3 years ago
facebookresearch / speech-resynthesis
View on GitHub
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…
☆416Aug 29, 2023Updated 2 years ago
ttslr / MonTTS
View on GitHub
☆16Dec 23, 2021Updated 4 years ago
joshua-decoder / fisher-callhome-corpus
View on GitHub
The Fisher and CALLHOME Spanish–English Speech Translation Corpus
☆41Feb 10, 2022Updated 4 years ago
ictnlp / DiSeg
View on GitHub
Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"
☆37Dec 6, 2023Updated 2 years ago
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
isl-mt / SLT.KIT
View on GitHub
Spoken Language Translation System
☆20Jul 26, 2021Updated 5 years ago
formiel / speech-translation
View on GitHub
Multilingual speech translation
☆42Apr 15, 2021Updated 5 years ago
nethermanpro / ComSL
View on GitHub
☆11Oct 14, 2023Updated 2 years ago
tencent-ailab / pika
View on GitHub
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
☆354Dec 25, 2020Updated 5 years ago
xuchenneu / SATE
View on GitHub
End-to-end Speech Translation with Stacked Acoustic-and-Textual Encoding
☆26Aug 12, 2021Updated 4 years ago
ictnlp / CRESS
View on GitHub
Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".
☆16Oct 25, 2023Updated 2 years ago
mgaido91 / FBK-fairseq-ST
View on GitHub
A repository containing the code for speech translation papers.
☆21Mar 11, 2022Updated 4 years ago