ictnlp/ComSpeech

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ictnlp/ComSpeech)

ictnlp / ComSpeech

Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".

☆27

Alternatives and similar repositories for ComSpeech

Users that are interested in ComSpeech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ictnlp / DST
View on GitHub
DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently
☆11Jun 6, 2024Updated 2 years ago
ictnlp / NAST-S2x
View on GitHub
A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.
☆78Oct 22, 2024Updated last year
ictnlp / BT4ST
View on GitHub
Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".
☆11Oct 25, 2023Updated 2 years ago
nethermanpro / ComSL
View on GitHub
☆11Oct 14, 2023Updated 2 years ago
ictnlp / FastLongSpeech
View on GitHub
FastLongSpeech is a novel framework designed to extend the capabilities of Large Speech-Language Models for efficient long-speech process…
☆16Jul 22, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ictnlp / CMOT
View on GitHub
Code for ACL 2023 main conference paper "CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation"
☆17Oct 29, 2024Updated last year
ictnlp / Dual-Path
View on GitHub
Code for ACL 2022 main conference paper "Modeling Dual Read/Write Paths for Simultaneous Machine Translation"
☆12Mar 31, 2022Updated 4 years ago
ictnlp / ITST
View on GitHub
Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"
☆13Nov 3, 2022Updated 3 years ago
ictnlp / DiSeg
View on GitHub
Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"
☆37Dec 6, 2023Updated 2 years ago
ictnlp / StreamUni
View on GitHub
StreamUni is a framework that efficiently enables unified Large Speech-Language Models to accomplish streaming speech translation in a co…
☆22Jul 14, 2025Updated last year
ictnlp / LSG
View on GitHub
The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”
☆15Jan 3, 2025Updated last year
ictnlp / SLED-TTS
View on GitHub
Streamable Text-to-Speech model using a language modeling approach, without vector quantization
☆108May 20, 2025Updated last year
ictnlp / LNMT-CA
View on GitHub
Code for EMNLP 2022 main conference paper "Low-resource Neural Machine Translation with Cross-modal Alignment".
☆15Apr 25, 2023Updated 3 years ago
ictnlp / CRESS
View on GitHub
Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".
☆16Oct 25, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ictnlp / SiLLM
View on GitHub
SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a t…
☆18Feb 22, 2024Updated 2 years ago
ictnlp / GMA
View on GitHub
Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"
☆11Mar 31, 2022Updated 4 years ago
P1ping / TokAN-Legacy
View on GitHub
☆27Jun 22, 2026Updated last month
kaistmm / AdaptVC
View on GitHub
☆17Jun 2, 2025Updated last year
Jason-Young-AI / YoungToolkit
View on GitHub
A Toolkit for a series of Young projects.
☆23Apr 30, 2021Updated 5 years ago
ictnlp / FA-DAT
View on GitHub
Official Implementation for the ICLR2023 paper "Fuzzy Alignments in Directed Acyclic Graph for Non-autoregressive Machine Translation"
☆14Mar 1, 2023Updated 3 years ago
shinhyeokoh / rwen
View on GitHub
☆14Jun 16, 2023Updated 3 years ago
ictnlp / PCFG-NAT
View on GitHub
Code for NeurIPS 2023 paper "Non-autoregressive Machine Translation with Probabilistic Context-free Grammar".
☆12Jan 4, 2024Updated 2 years ago
rishikksh20 / MiniMax-TTS-pytorch
View on GitHub
Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report
☆47Sep 2, 2025Updated 10 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
sarulab-speech / spatial_voice_conversion
View on GitHub
Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals
☆18Aug 8, 2024Updated last year
ZehuaKcrissLi / GTR-Voice
View on GitHub
☆16Nov 11, 2024Updated last year
ictnlp / STEMM
View on GitHub
Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".
☆35Oct 25, 2023Updated 2 years ago
liamdugan / speech-to-speech
View on GitHub
Code for the INTERSPEECH 2023 paper "Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation…
☆32Jan 14, 2025Updated last year
choiHkk / Transformer-TTS-V2
View on GitHub
☆25Mar 6, 2024Updated 2 years ago
ReneeYe / ConST
View on GitHub
code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)
☆64May 25, 2022Updated 4 years ago
AMAAI-Lab / DART
View on GitHub
Demo for DART, Audio Imagination workshop submission in NeurIPS 2024
☆16Apr 22, 2026Updated 3 months ago
ictnlp / NMLA-NAT
View on GitHub
Code for NeurIPS 2022 Spotlight paper " Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation"
☆20Nov 16, 2022Updated 3 years ago
openaudiolab / LLaST
View on GitHub
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
☆26Aug 11, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ictnlp / TLAT-NMT
View on GitHub
Source code for the EMNLP 2020 long paper <Token-level Adaptive Training for Neural Machine Translation>.
☆20Oct 28, 2022Updated 3 years ago
tabahi / contexless-phonemes-CUPE
View on GitHub
pytorch model for contexless-phoneme prediction from speech audio
☆32Oct 30, 2025Updated 8 months ago
ictnlp / DASpeech
View on GitHub
Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".
☆63Jul 22, 2024Updated 2 years ago
titu1994 / warprnnt_numba
View on GitHub
WarpRNNT loss ported in Numba CPU/CUDA for Pytorch
☆17Mar 11, 2022Updated 4 years ago
warisqr007 / ppg2ppg
View on GitHub
Zero-Shot Foreign Accent Conversion without a Native Reference
☆36May 1, 2024Updated 2 years ago
BayLing-Models / BayLing-Duplex
View on GitHub
Native full-duplex speech dialogue inference for BayLing-Duplex.
☆63Jun 22, 2026Updated last month
Dapwner / CVAE-Tacotron
View on GitHub
☆26Jun 5, 2024Updated 2 years ago