End-to-end Speech Translation with Stacked Acoustic-and-Textual Encoding
☆26Aug 12, 2021Updated 4 years ago
Alternatives and similar repositories for SATE
Users that are interested in SATE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for ACL 2024 findings paper "wav2vec-S: Adapting Pre-trained Speech Models for Streaming"☆12Apr 21, 2026Updated 2 weeks ago
- Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"☆37Dec 6, 2023Updated 2 years ago
- ☆14Nov 16, 2022Updated 3 years ago
- ☆35Sep 1, 2022Updated 3 years ago
- A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021☆47Feb 21, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆26Jul 2, 2024Updated last year
- Tracking the progress in end-to-end speech translation☆260Oct 25, 2023Updated 2 years ago
- The case study and multilingfual performance of ICASSP submission☆24Sep 24, 2022Updated 3 years ago
- [EACL'23] MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages☆23Feb 13, 2023Updated 3 years ago
- Code for "A Novel Graph-based Multi-modal Fusion Encoder for Neural Machine Translation"(ACL2020)☆13Sep 14, 2021Updated 4 years ago
- [TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation☆31Sep 6, 2024Updated last year
- ☆47Nov 1, 2025Updated 6 months ago
- Source code for <Sequence-Level Training for Non-Autoregressive Neural Machine Translation>.☆24Jan 17, 2022Updated 4 years ago
- List of direct speech-to-speech translation papers.☆39Jan 31, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆28May 8, 2024Updated 2 years ago
- Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks https://arxiv.org/abs/19…☆14Apr 16, 2020Updated 6 years ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆110Mar 30, 2025Updated last year
- A fully and partially fake speech dataset for evaluation☆15Nov 11, 2025Updated 5 months ago
- ☆16Dec 23, 2021Updated 4 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆26Mar 17, 2025Updated last year
- Fairseq tutorial☆18May 18, 2022Updated 3 years ago
- ☆86Dec 26, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".☆35Oct 25, 2023Updated 2 years ago
- ☆13Jul 13, 2022Updated 3 years ago
- A Fairseq implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.☆11Dec 21, 2020Updated 5 years ago
- STT Service based on Kaldi ASR☆15Aug 17, 2018Updated 7 years ago
- uyghur text resource crawled from website☆12Dec 25, 2015Updated 10 years ago
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- Code for paper Sentence-aware Contrastive Learning for Open-Domain Passage Retrieval, Accepted by ACL2022 Main Conference, Long Paper☆30Mar 12, 2022Updated 4 years ago
- Simple Text Classification[WIP]☆11Dec 30, 2022Updated 3 years ago
- This is the repo of our work titled “Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception”☆33Mar 31, 2026Updated last month
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Modified version of fairseq, including new implementations for criterions using reinforcement learning methods.☆11Aug 14, 2019Updated 6 years ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- Neural Fuzzy Repair (NFR) is a data augmentation pipeline, which integrates fuzzy matches (i.e. similar translations) into neural machine…☆12Aug 14, 2024Updated last year
- Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15☆12Apr 17, 2017Updated 9 years ago
- SpeechFake: A Large-Scale Multilingual Speech Deepfake Dataset Incorporating Cutting-Edge Generation Methods☆27Aug 13, 2025Updated 8 months ago
- This repository implements our EMNLP 2022 research paper A Dataset for Hyper-Relational Extraction and a Cube-Filling Approach.☆28Dec 13, 2022Updated 3 years ago
- [NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings☆17Oct 29, 2022Updated 3 years ago