Code for the INTERSPEECH 2023 paper "Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models"
☆32Jan 14, 2025Updated last year
Alternatives and similar repositories for speech-to-speech
Users that are interested in speech-to-speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently☆11Jun 6, 2024Updated 2 years ago
- Code for the AAAI 2023 Paper "Real or Fake Text?: Investigating Human Ability to Detect Boundaries Between Human-Written and Machine-Gene…☆16Oct 29, 2024Updated last year
- Code for "A Bilingual Generative Transformer for Semantic Sentence Embedding" published at EMNLP 2020.☆10Nov 20, 2020Updated 5 years ago
- Code for the ACL 2022 Paper "A Feasibility Study of Answer-Agnostic Question Generation for Education"☆16Jul 5, 2022Updated 3 years ago
- Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"☆13Nov 3, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ACL 2024] An easily extensible framework for simultaneous, text-to-text neural machine translation (SimulMT) for LLMs.☆18Apr 21, 2025Updated last year
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆26Jul 2, 2024Updated last year
- Code for ACL 2022 main conference paper "Modeling Dual Read/Write Paths for Simultaneous Machine Translation"☆12Mar 31, 2022Updated 4 years ago
- DirectX Raytracing Path Tracer☆57Jun 23, 2022Updated 3 years ago
- Code for ACL-IJCNLP 2021 paper "N-Best-ASR-Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses."☆17Nov 30, 2021Updated 4 years ago
- ☆35Sep 1, 2022Updated 3 years ago
- A Toolkit for a series of Young projects.☆23Apr 30, 2021Updated 5 years ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆109May 20, 2025Updated last year
- Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"☆37Dec 6, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 4 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆16Mar 26, 2022Updated 4 years ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jun 2, 2026Updated 2 weeks ago
- A hierarchical environment manager for bash, written in bash.☆17Apr 18, 2026Updated 2 months ago
- Generate SQUAD style dataset from raw text file and train a transformer based question answering model .This repo has code from https://g…☆13Aug 17, 2025Updated 10 months ago
- A toolkit for any-to-any encoder-decoder voice conversion systems☆84Aug 10, 2023Updated 2 years ago
- Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"☆11Mar 31, 2022Updated 4 years ago
- A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.☆78Oct 22, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Oct 19, 2022Updated 3 years ago
- A Neovim Jira plugin☆11Apr 20, 2023Updated 3 years ago
- Official Implementation for the ICLR2023 paper "Fuzzy Alignments in Directed Acyclic Graph for Non-autoregressive Machine Translation"☆14Mar 1, 2023Updated 3 years ago
- RAID is the largest and most challenging benchmark for AI-generated text detection. (ACL 2024)☆194Jun 7, 2026Updated last week
- Provides a wrapper to boot your penn-os on hardware!☆10Dec 7, 2017Updated 8 years ago
- CMU multilingual speech repository☆30Apr 15, 2022Updated 4 years ago
- A fast parallel implementation of RNN Transducer.☆12Apr 8, 2025Updated last year
- Code for NeurIPS 2023 paper "Non-autoregressive Machine Translation with Probabilistic Context-free Grammar".☆12Jan 4, 2024Updated 2 years ago
- Official implementation of EMNLP 2023 Findings paper "Enhanced Simultaneous Machine Translation with Word-level Policies"☆18Apr 10, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A future hobby OS kernel☆11Nov 8, 2020Updated 5 years ago
- A system for creating experimental AAC prototypes.☆16May 4, 2026Updated last month
- Exploring aspects of similarity between spoken personal narratives by disentangling them into narrative clause types -- Supplementary inf…☆12Jul 14, 2020Updated 5 years ago
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- Order food via terminal.☆15Dec 29, 2020Updated 5 years ago
- Jointly encoding word confusion networks (WCNs) and dialogue context with BERT for spoken language understanding (SLU).☆12Jun 12, 2023Updated 3 years ago
- ☆23Apr 3, 2025Updated last year