mgaido91 / FBK-fairseq-ST
A repository containing the code for speech translation papers.
☆22Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for FBK-fairseq-ST
- ☆34Updated 2 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆19Updated 3 months ago
- End-to-end Speech Translation with Stacked Acoustic-and-Textual Encoding☆24Updated 3 years ago
- Repository containing the open source code of works published at the FBK MT unit.☆42Updated 4 months ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆37Updated last year
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆63Updated 8 months ago
- Systems submitted to IWSLT 2021 by the MT-UPC group.☆14Updated last year
- End-to-end Speech Translation☆36Updated 3 years ago
- ☆28Updated 2 years ago
- An adaptation of Fairseq to (End-to-end) speech translation.☆22Updated 2 years ago
- Repository for SLURP paper☆97Updated 2 years ago
- Tracking the progress in end-to-end speech translation☆254Updated last year
- Repo for the FB AI Speech team.☆22Updated 3 years ago
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25Updated last year
- A recipe for constituency parsing, disfluency tagging and obtaining the fluent transcripts of English Fisher dataset☆12Updated 3 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- ☆175Updated 3 years ago
- A curated list of awesome disfluency detection publications along with the released code and bibliographical information☆70Updated 3 years ago
- Multilingual speech translation☆41Updated 3 years ago
- Spoken Language Translation System☆19Updated 3 years ago
- The Fisher and CALLHOME Spanish–English Speech Translation Corpus☆38Updated 2 years ago
- This repository is for the paper Incorporating External POS Tagger for Punctuation Restoration. Proc. Interspeech 2021, 1987-1991, doi: 1…☆11Updated last year
- ASCEND Chinese-English code-switching dataset☆22Updated 2 years ago
- Automatic Mapping of Disfluency Annotations for corrected version of Switchboard☆17Updated 5 years ago
- code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)☆62Updated 2 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆42Updated last year
- A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/ab…☆32Updated 9 months ago
- ☆28Updated 3 years ago
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆47Updated 3 years ago