dqqcasia / stView external linksLinks
End-to-end Speech Translation
☆35Apr 12, 2021Updated 4 years ago
Alternatives and similar repositories for st
Users that are interested in st are comparing it to the libraries listed below
Sorting:
- ☆27Aug 31, 2022Updated 3 years ago
- ☆179Nov 10, 2021Updated 4 years ago
- This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)☆19May 1, 2022Updated 3 years ago
- Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".☆36Oct 25, 2023Updated 2 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Oct 8, 2021Updated 4 years ago
- Neural end-to-end Speech Translation Toolkit☆307Jun 28, 2022Updated 3 years ago
- Multilingual speech translation☆41Apr 15, 2021Updated 4 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021☆48Feb 21, 2022Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)☆65May 25, 2022Updated 3 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- [ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…☆80Jan 9, 2025Updated last year
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆33Jul 31, 2024Updated last year
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- CMU multilingual speech repository☆30Apr 15, 2022Updated 3 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- Tracking the progress in end-to-end speech translation☆261Oct 25, 2023Updated 2 years ago
- ASR & TTS joint training, asr, tts, machine speech chain☆16Oct 16, 2021Updated 4 years ago
- Non-Autoregressive Predictive Coding☆51Nov 3, 2020Updated 5 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Sep 16, 2024Updated last year
- 单独维护的中文TTS☆34Oct 28, 2022Updated 3 years ago
- Scripts for training Kaldi for German speech recognition (ASR).☆26Feb 11, 2021Updated 5 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- REST api for mozilla deepspeech voice recognition engine☆20Nov 1, 2021Updated 4 years ago
- ☆35Sep 1, 2022Updated 3 years ago
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…☆15Oct 13, 2022Updated 3 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆15Dec 19, 2018Updated 7 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"☆17Dec 23, 2021Updated 4 years ago
- ☆18Aug 9, 2018Updated 7 years ago
- Vim Speech Recognition Experiments☆20May 30, 2025Updated 8 months ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- ☆37Jun 28, 2021Updated 4 years ago
- A repository containing the code for speech translation papers.☆21Mar 11, 2022Updated 3 years ago
- This setup allows to train end-to-end neural models for spoken language understanding (SLU).☆24Jun 12, 2023Updated 2 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago