Code for the INTERSPEECH 2023 paper "Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models"
☆32Jan 14, 2025Updated last year
Alternatives and similar repositories for speech-to-speech
Users that are interested in speech-to-speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently☆11Jun 6, 2024Updated last year
- Code for the AAAI 2023 Paper "Real or Fake Text?: Investigating Human Ability to Detect Boundaries Between Human-Written and Machine-Gene…☆16Oct 29, 2024Updated last year
- Code for "A Bilingual Generative Transformer for Semantic Sentence Embedding" published at EMNLP 2020.☆10Nov 20, 2020Updated 5 years ago
- Code for the ACL 2022 Paper "A Feasibility Study of Answer-Agnostic Question Generation for Education"☆16Jul 5, 2022Updated 3 years ago
- Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"☆13Nov 3, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ACL 2024] An easily extensible framework for simultaneous, text-to-text neural machine translation (SimulMT) for LLMs.☆18Apr 21, 2025Updated last year
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆26Jul 2, 2024Updated last year
- Code for ACL 2022 main conference paper "Modeling Dual Read/Write Paths for Simultaneous Machine Translation"☆12Mar 31, 2022Updated 4 years ago
- DirectX Raytracing Path Tracer☆57Jun 23, 2022Updated 3 years ago
- Code for ACL-IJCNLP 2021 paper "N-Best-ASR-Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses."☆17Nov 30, 2021Updated 4 years ago
- ☆35Sep 1, 2022Updated 3 years ago
- A Toolkit for a series of Young projects.☆23Apr 30, 2021Updated 5 years ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆110May 20, 2025Updated 11 months ago
- Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"☆37Dec 6, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 4 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 4 years ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 9 months ago
- A hierarchical environment manager for bash, written in bash.☆17Apr 18, 2026Updated 3 weeks ago
- Generate SQUAD style dataset from raw text file and train a transformer based question answering model .This repo has code from https://g…☆13Aug 17, 2025Updated 8 months ago
- A toolkit for any-to-any encoder-decoder voice conversion systems☆84Aug 10, 2023Updated 2 years ago
- Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"☆11Mar 31, 2022Updated 4 years ago
- A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.☆78Oct 22, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Oct 19, 2022Updated 3 years ago
- A Neovim Jira plugin☆11Apr 20, 2023Updated 3 years ago
- Official Implementation for the ICLR2023 paper "Fuzzy Alignments in Directed Acyclic Graph for Non-autoregressive Machine Translation"☆14Mar 1, 2023Updated 3 years ago
- CMU multilingual speech repository☆30Apr 15, 2022Updated 4 years ago
- A fast parallel implementation of RNN Transducer.☆12Apr 8, 2025Updated last year
- Code for NeurIPS 2023 paper "Non-autoregressive Machine Translation with Probabilistic Context-free Grammar".☆12Jan 4, 2024Updated 2 years ago
- Official implementation of EMNLP 2023 Findings paper "Enhanced Simultaneous Machine Translation with Word-level Policies"☆18Apr 10, 2026Updated 3 weeks ago
- A future hobby OS kernel☆11Nov 8, 2020Updated 5 years ago
- Exploring aspects of similarity between spoken personal narratives by disentangling them into narrative clause types -- Supplementary inf…☆12Jul 14, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆31Sep 20, 2025Updated 7 months ago
- Order food via terminal.☆15Dec 29, 2020Updated 5 years ago
- Jointly encoding word confusion networks (WCNs) and dialogue context with BERT for spoken language understanding (SLU).☆12Jun 12, 2023Updated 2 years ago
- ☆23Apr 3, 2025Updated last year
- Implementation of DiffWave and SaShiMi audio generation models☆128Apr 4, 2023Updated 3 years ago
- Official code repository for the paper: AbsPyramid: Benchmarking the Abstration Ability of Language Models with a Unified Entailment Grap…☆13Oct 30, 2024Updated last year
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago