A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021
☆47Feb 21, 2022Updated 4 years ago
Alternatives and similar repositories for Chimera-ST
Users that are interested in Chimera-ST are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".☆36Oct 25, 2023Updated 2 years ago
- End-to-end Speech Translation☆35Apr 12, 2021Updated 5 years ago
- ☆11Oct 14, 2023Updated 2 years ago
- code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)☆65May 25, 2022Updated 3 years ago
- The Fisher and CALLHOME Spanish–English Speech Translation Corpus☆41Feb 10, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Multilingual speech translation☆41Apr 15, 2021Updated 5 years ago
- ☆179Nov 10, 2021Updated 4 years ago
- This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)☆19May 1, 2022Updated 3 years ago
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆12Oct 25, 2023Updated 2 years ago
- Official code for Wav2Seq☆97Jul 19, 2022Updated 3 years ago
- Improving Low-Resource Neural Machine Translation of Related Languages by Transfer Learning☆19Nov 3, 2022Updated 3 years ago
- Revisiting End-to-End Speech-to-Text Translation From Scratch☆13Feb 21, 2023Updated 3 years ago
- Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"☆17Dec 23, 2021Updated 4 years ago
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆26Jul 2, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆27Aug 31, 2022Updated 3 years ago
- A light webserver for monitoring RAM and GPU usage on multiple servers.☆21Mar 31, 2021Updated 5 years ago
- End-to-end Speech Translation with Stacked Acoustic-and-Textual Encoding☆26Aug 12, 2021Updated 4 years ago
- Neural end-to-end Speech Translation Toolkit☆306Jun 28, 2022Updated 3 years ago
- Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".☆17Oct 25, 2023Updated 2 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- ☆14Nov 16, 2022Updated 3 years ago
- Code and pretrained models for "DUB: Discrete Unit Back-translation for Speech Translation" (ACL 2023 Findings)☆27Jun 28, 2023Updated 2 years ago
- SLTUNET: A Simple Unified Model for Sign Language Translation (ICLR 2023)☆37Jul 10, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆35Sep 1, 2022Updated 3 years ago
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30May 31, 2022Updated 3 years ago
- Data and Code for StructuredRegex.☆14Nov 16, 2023Updated 2 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Sep 24, 2021Updated 4 years ago
- Question and answer retrieval in Turkish with BERT☆14Nov 30, 2021Updated 4 years ago
- Transformer based ASR Engine.☆13Aug 23, 2021Updated 4 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆76Dec 4, 2022Updated 3 years ago
- Source code for the EMNLP 2020 long paper <Token-level Adaptive Training for Neural Machine Translation>.☆20Oct 28, 2022Updated 3 years ago
- Pre-trained Wav2vec2.0 for Mandarin☆43Oct 30, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- 🚀 Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeed☆31Feb 5, 2022Updated 4 years ago
- SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a t…☆18Feb 22, 2024Updated 2 years ago
- BLSP-Emo: Towards Empathetic Large Speech-Language Models☆61Jun 7, 2024Updated last year
- Official repository for U-SAM (Interspeech 2025)☆27Jun 3, 2025Updated 10 months ago
- The repository for the paper: Rethinking Document-level Neural Machine Translation☆25Dec 20, 2022Updated 3 years ago
- ☆21Mar 7, 2023Updated 3 years ago