CoraJung / flexible-input-slu
This setup allows to train end-to-end neural models for spoken language understanding (SLU).
☆11Updated last year
Alternatives and similar repositories for flexible-input-slu:
Users that are interested in flexible-input-slu are comparing it to the libraries listed below
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Updated 2 years ago
- Token-Level Supervised Contrastive Learning for Punctuation Restoration☆29Updated 3 years ago
- This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)☆20Updated 2 years ago
- End-to-end Speech Translation☆36Updated 3 years ago
- Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"☆18Updated 3 years ago
- ☆28Updated 4 years ago
- ☆33Updated 5 months ago
- ☆24Updated 4 years ago
- ☆16Updated 7 months ago
- code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)☆63Updated 2 years ago
- ☆28Updated 2 years ago
- Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"☆11Updated 2 years ago
- ☆33Updated 3 years ago
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25Updated last year
- This repository is for the paper Incorporating External POS Tagger for Punctuation Restoration. Proc. Interspeech 2021, 1987-1991, doi: 1…☆11Updated last year
- The case study and multilingfual performance of ICASSP submission☆20Updated 2 years ago
- Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding☆10Updated last year
- Code and pretrained models for "DUB: Discrete Unit Back-translation for Speech Translation" (ACL 2023 Findings)☆27Updated last year
- ☆37Updated 4 years ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆46Updated last year
- Code for ACL 2022 main conference paper "Modeling Dual Read/Write Paths for Simultaneous Machine Translation"☆12Updated 2 years ago
- BLSP: Bootstrapping Langauge-Speech Pre-training via Behavior Alignment of Continuation Writing☆49Updated 10 months ago
- This setup allows to train end-to-end neural models for spoken language understanding (SLU).☆22Updated last year
- Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".☆36Updated last year
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆33Updated last year
- The dataset and baseline code for Text-to-Audio Grounding (TAG)☆41Updated 2 weeks ago
- ☆11Updated 2 years ago
- ☆16Updated 7 years ago
- ☆10Updated 4 years ago
- Pre-training Cross-modal Transformer for Audio-and-Language Representations☆39Updated 3 years ago