Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".
☆35Oct 25, 2023Updated 2 years ago
Alternatives and similar repositories for STEMM
Users that are interested in STEMM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".☆16Oct 25, 2023Updated 2 years ago
- code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)☆64May 25, 2022Updated 4 years ago
- Code for ACL 2023 main conference paper "CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation"☆17Oct 29, 2024Updated last year
- Code for ACL 2022 main conference paper "Neural Machine Translation with Phrase-Level Universal Visual Representations".☆21Oct 25, 2023Updated 2 years ago
- End-to-end Speech Translation☆35Apr 12, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for EMNLP 2022 main conference paper "Low-resource Neural Machine Translation with Cross-modal Alignment".☆15Apr 25, 2023Updated 3 years ago
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆11Oct 25, 2023Updated 2 years ago
- This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)☆18May 1, 2022Updated 4 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆20May 13, 2026Updated 3 weeks ago
- ☆179Nov 10, 2021Updated 4 years ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆26Aug 11, 2024Updated last year
- Code and pretrained models for "DUB: Discrete Unit Back-translation for Speech Translation" (ACL 2023 Findings)☆27Jun 28, 2023Updated 2 years ago
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆26Jul 2, 2024Updated last year
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Dec 12, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Neural end-to-end Speech Translation Toolkit☆307Jun 28, 2022Updated 3 years ago
- Official Implementation for the ICLR2023 paper "Fuzzy Alignments in Directed Acyclic Graph for Non-autoregressive Machine Translation"☆14Mar 1, 2023Updated 3 years ago
- Code for NeurIPS 2023 paper "Non-autoregressive Machine Translation with Probabilistic Context-free Grammar".☆12Jan 4, 2024Updated 2 years ago
- Code repository for CVPR 2022 paper "VALHALLA: Visual Hallucination for Machine Translation"☆28Feb 19, 2023Updated 3 years ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 4 years ago
- Tracking the progress in end-to-end speech translation☆260Oct 25, 2023Updated 2 years ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".☆63Jul 22, 2024Updated last year
- A repository containing the code for speech translation papers.☆21Mar 11, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- “百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型,具有优越的英语/中文能力,在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced l…☆315Dec 3, 2024Updated last year
- This code repository is for the accepted ACL2022 paper "On Vision Features in Multimodal Machine Translation". We provide the details and…☆44Sep 16, 2022Updated 3 years ago
- The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models☆24Sep 22, 2021Updated 4 years ago
- ☆30Nov 3, 2020Updated 5 years ago
- This repository contains the official implementation code of the paper Transformer-based Feature Reconstruction Network for Robust Multim…☆43Apr 16, 2023Updated 3 years ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆28May 30, 2025Updated last year
- ☆21Oct 13, 2021Updated 4 years ago
- ☆14Jun 16, 2023Updated 2 years ago
- This is the official code for our paper "Simple and Scalable Nearest Neighbor Machine Translation" (ICLR 2023).☆14Nov 22, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Paradigm shift in natural language processing☆42May 29, 2022Updated 4 years ago
- Multilingual speech translation☆42Apr 15, 2021Updated 5 years ago
- [TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation☆31Sep 6, 2024Updated last year
- ☆53Dec 6, 2021Updated 4 years ago
- ☆13Mar 11, 2024Updated 2 years ago
- ☆12Apr 18, 2025Updated last year
- Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"☆17Dec 23, 2021Updated 4 years ago