This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)
☆19May 1, 2022Updated 3 years ago
Alternatives and similar repositories for XSTNet
Users that are interested in XSTNet are comparing it to the libraries listed below
Sorting:
- ☆11Oct 14, 2023Updated 2 years ago
- ☆15Oct 19, 2021Updated 4 years ago
- code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)☆65May 25, 2022Updated 3 years ago
- ASR & TTS joint training, asr, tts, machine speech chain☆16Oct 16, 2021Updated 4 years ago
- Code for ACL 2023 main conference paper "CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation"☆17Oct 29, 2024Updated last year
- Multilingual speech translation☆41Apr 15, 2021Updated 4 years ago
- code repo for EMNLP'21 Finding Counter-Interference Adapter for Multilingual Machine Translation☆18Oct 19, 2022Updated 3 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆18Dec 20, 2024Updated last year
- A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021☆48Feb 21, 2022Updated 4 years ago
- Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"☆17Dec 23, 2021Updated 4 years ago
- Code for our work "MSP: Multi-Stage Prompting for Making Pre-trained Language Models Better Translators" in ACL 2022☆20Mar 18, 2022Updated 3 years ago
- Official release of StyleTalk dataset.☆72Jul 1, 2024Updated last year
- ☆179Nov 10, 2021Updated 4 years ago
- A list of resources that can help in research for automated audio captioning☆34Feb 17, 2021Updated 5 years ago
- Neural end-to-end Speech Translation Toolkit☆307Jun 28, 2022Updated 3 years ago
- Template and steps to build your personal blog using Jekyll and Minimal Mistake☆10Feb 24, 2020Updated 6 years ago
- Beyond Entities: A Large-Scale Multi-Modal Knowledge Graph with Triplet Fact Grounding☆11May 23, 2024Updated last year
- This is a data repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆10May 5, 2020Updated 5 years ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆12Aug 1, 2025Updated 7 months ago
- This repo contains the code for paper "nuCarla: A nuScenes-Style Bird’s-Eye View Perception Dataset for CARLA Simulation"☆31Jan 2, 2026Updated last month
- A zoneout implemetion based on pytorch☆10Jan 22, 2019Updated 7 years ago
- Implementation of the attention-sum reader using tensorflow and keras.☆11Aug 1, 2017Updated 8 years ago
- ☆11Apr 26, 2021Updated 4 years ago
- This repository is created on top of two repositories i.e., yolov7 face detection and yolov7 blurring object☆15Jan 21, 2023Updated 3 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 5 years ago
- ☆10Apr 3, 2024Updated last year
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆12Oct 25, 2023Updated 2 years ago
- ☆12May 22, 2022Updated 3 years ago
- Deep Learning Framework on top of Theano☆10Jan 15, 2018Updated 8 years ago
- the Tensorflow version of multi-speaker TTS training with feedback constraint☆40Oct 12, 2020Updated 5 years ago
- Codes for "Benchmarking the Generation of Fact Checking Explanations"☆10Aug 16, 2024Updated last year
- ☆10Sep 7, 2022Updated 3 years ago
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- ☆11Nov 14, 2021Updated 4 years ago
- Pytorch code for the paper 'Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acousti…☆14Nov 12, 2020Updated 5 years ago
- ☆10Jan 11, 2023Updated 3 years ago
- This code repository is for the accepted ACL2022 paper "On Vision Features in Multimodal Machine Translation". We provide the details and…☆43Sep 16, 2022Updated 3 years ago
- [ICPRAI 2024] DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents☆16Apr 4, 2024Updated last year
- ☆10Nov 15, 2020Updated 5 years ago