A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021
☆48Feb 21, 2022Updated 4 years ago
Alternatives and similar repositories for Chimera-ST
Users that are interested in Chimera-ST are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".☆36Oct 25, 2023Updated 2 years ago
- Tracking the progress in end-to-end speech translation☆261Oct 25, 2023Updated 2 years ago
- code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)☆65May 25, 2022Updated 3 years ago
- The Fisher and CALLHOME Spanish–English Speech Translation Corpus☆41Feb 10, 2022Updated 4 years ago
- Multilingual speech translation☆41Apr 15, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆179Nov 10, 2021Updated 4 years ago
- This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)☆19May 1, 2022Updated 3 years ago
- Code for ACL 2023 main conference paper "CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation"☆17Oct 29, 2024Updated last year
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆12Oct 25, 2023Updated 2 years ago
- Official code for Wav2Seq☆97Jul 19, 2022Updated 3 years ago
- Improving Low-Resource Neural Machine Translation of Related Languages by Transfer Learning☆19Nov 3, 2022Updated 3 years ago
- Revisiting End-to-End Speech-to-Text Translation From Scratch☆13Feb 21, 2023Updated 3 years ago
- Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"☆17Dec 23, 2021Updated 4 years ago
- A benchmark corpus for ASR hypothesis revising task☆21Sep 26, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆27Aug 31, 2022Updated 3 years ago
- End-to-end Speech Translation with Stacked Acoustic-and-Textual Encoding☆26Aug 12, 2021Updated 4 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- ☆14Nov 16, 2022Updated 3 years ago
- Code and pretrained models for "DUB: Discrete Unit Back-translation for Speech Translation" (ACL 2023 Findings)☆28Jun 28, 2023Updated 2 years ago
- ☆35Sep 1, 2022Updated 3 years ago
- Data and Code for StructuredRegex.☆14Nov 16, 2023Updated 2 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Sep 24, 2021Updated 4 years ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆27Aug 11, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Transformer based ASR Engine.☆13Aug 23, 2021Updated 4 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆76Dec 4, 2022Updated 3 years ago
- Pre-trained Wav2vec2.0 for Mandarin☆43Oct 30, 2022Updated 3 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- 🚀 Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeed☆31Feb 5, 2022Updated 4 years ago
- SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a t…☆18Feb 22, 2024Updated 2 years ago
- BLSP-Emo: Towards Empathetic Large Speech-Language Models☆60Jun 7, 2024Updated last year
- Official repository for U-SAM (Interspeech 2025)☆26Jun 3, 2025Updated 9 months ago
- The repository for the paper: Rethinking Document-level Neural Machine Translation☆25Dec 20, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆21Mar 7, 2023Updated 3 years ago
- Zero -- A neural machine translation system☆152May 8, 2023Updated 2 years ago
- SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP☆14Mar 24, 2021Updated 5 years ago
- ☆21Sep 10, 2021Updated 4 years ago
- An end to end ASR Transformer model training repo☆13Dec 8, 2021Updated 4 years ago
- Code corresponding to our paper "A Graph-to-Sequence Model for AMR-to-Text Generation"☆137Mar 17, 2021Updated 5 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 4 years ago