xuchenneu/SATE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xuchenneu/SATE)

xuchenneu / SATE

End-to-end Speech Translation with Stacked Acoustic-and-Textual Encoding

☆26

Alternatives and similar repositories for SATE

Users that are interested in SATE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ictnlp / DiSeg
View on GitHub
Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"
☆37Dec 6, 2023Updated 2 years ago
ustctf-zz / delibnet
View on GitHub
☆14Nov 16, 2022Updated 3 years ago
danliu2 / caat
View on GitHub
☆35Sep 1, 2022Updated 3 years ago
Glaciohound / Chimera-ST
View on GitHub
A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021
☆47Feb 21, 2022Updated 4 years ago
ictnlp / ComSpeech
View on GitHub
Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".
☆27Jul 2, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kahne / SpeechTransProgress
View on GitHub
Tracking the progress in end-to-end speech translation
☆260Oct 25, 2023Updated 2 years ago
dqqcasia / st
View on GitHub
End-to-end Speech Translation
☆35Apr 12, 2021Updated 5 years ago
Yaoming95 / UniPunc
View on GitHub
The case study and multilingfual performance of ICASSP submission
☆24Sep 24, 2022Updated 3 years ago
zorazrw / multilingual-conala
View on GitHub
[EACL'23] MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages
☆23Feb 13, 2023Updated 3 years ago
sarulab-speech / spatial_voice_conversion
View on GitHub
Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals
☆18Aug 8, 2024Updated last year
Chen-GX / ReForm
View on GitHub
☆21Jan 31, 2026Updated 5 months ago
choijeongsoo / utut
View on GitHub
[TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation
☆31Sep 6, 2024Updated last year
nethermanpro / ComSL
View on GitHub
☆11Oct 14, 2023Updated 2 years ago
Rongjiehuang / awesome-speech-to-speech-translation
View on GitHub
List of direct speech-to-speech translation papers.
☆39Jan 31, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
alecokas / BiLatticeRNN-Confidence
View on GitHub
Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks https://arxiv.org/abs/19…
☆14Apr 16, 2020Updated 6 years ago
MARIO-Math-Reasoning / MARIO
View on GitHub
☆28May 8, 2024Updated 2 years ago
Diamondfan / cassnat_asr
View on GitHub
Implementation of CTC alignment-based single step non-autoregressive transformer
☆13Jun 2, 2023Updated 3 years ago
hlt-mt / FBK-fairseq
View on GitHub
Repository containing the open source code of works published at the FBK MT unit.
☆60Mar 19, 2026Updated 4 months ago
syuqings / Fashion-MMT
View on GitHub
Dataset and codes for the paper "Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training".
☆25Mar 6, 2022Updated 4 years ago
bzhangGo / sltunet
View on GitHub
SLTUNET: A Simple Unified Model for Sign Language Translation (ICLR 2023)
☆39Jul 10, 2023Updated 3 years ago
cwang621 / blsp-emo
View on GitHub
BLSP-Emo: Towards Empathetic Large Speech-Language Models
☆62Jun 7, 2024Updated 2 years ago
zja-nlp / NAT_with_DAD
View on GitHub
☆10Mar 28, 2022Updated 4 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ictnlp / STEMM
View on GitHub
Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".
☆35Oct 25, 2023Updated 2 years ago
Saltychtao / fairseq-tutorial
View on GitHub
☆13Jul 13, 2022Updated 4 years ago
sooftware / Fairseq-Listen-Attend-Spell
View on GitHub
A Fairseq implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
☆11Dec 21, 2020Updated 5 years ago
Unbabel / smaug
View on GitHub
Python package to augment multilingual data
☆15Feb 15, 2023Updated 3 years ago
zhengxxn / adaptive-knn-mt
View on GitHub
☆86Dec 26, 2022Updated 3 years ago
chengzhipanpan / DCSR
View on GitHub
Code for paper Sentence-aware Contrastive Learning for Open-Domain Passage Retrieval, Accepted by ACL2022 Main Conference, Long Paper
☆30Mar 12, 2022Updated 4 years ago
azmat21 / UyghurTextResource
View on GitHub
uyghur text resource crawled from website
☆12Dec 25, 2015Updated 10 years ago
yhytoto12 / Behavior-SD
View on GitHub
Official Implementation of NAACL 2025 Paper: Behavior-SD: Behaviorally Aware Spoken Dialogue Generation with Large Language Models
☆18Apr 30, 2025Updated last year
TianchunH97 / fairseq-rl
View on GitHub
Modified version of fairseq, including new implementations for criterions using reinforcement learning methods.
☆11Aug 14, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lt3 / nfr
View on GitHub
Neural Fuzzy Repair (NFR) is a data augmentation pipeline, which integrates fuzzy matches (i.e. similar translations) into neural machine…
☆12Aug 14, 2024Updated last year
MLSpeech / DeepPhoneticToolsTutorial
View on GitHub
Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15
☆12Apr 17, 2017Updated 9 years ago
jfainberg / lattice_combination
View on GitHub
Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices
☆16Mar 19, 2024Updated 2 years ago
huiwy / reflection-on-trees
View on GitHub
☆14May 9, 2024Updated 2 years ago
LCF2764 / autoKWS2021_1st_solution
View on GitHub
Auto-KWS 2021 Challenge 1st place solution.
☆11Jul 20, 2021Updated 5 years ago
bigganbing / Fairseq_MorphTE
View on GitHub
[NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings
☆17Oct 29, 2022Updated 3 years ago
backspacetg / simul_whisper
View on GitHub
Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection
☆112Mar 30, 2025Updated last year