CoraJung / flexible-input-slu
This setup allows to train end-to-end neural models for spoken language understanding (SLU).
☆11Updated last year
Alternatives and similar repositories for flexible-input-slu:
Users that are interested in flexible-input-slu are comparing it to the libraries listed below
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Updated 2 years ago
- This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)☆20Updated 2 years ago
- End-to-end Speech Translation☆36Updated 4 years ago
- Token-Level Supervised Contrastive Learning for Punctuation Restoration☆29Updated 3 years ago
- ☆24Updated 4 years ago
- Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"☆18Updated 3 years ago
- This repository is for the paper Incorporating External POS Tagger for Punctuation Restoration. Proc. Interspeech 2021, 1987-1991, doi: 1…☆11Updated last year
- ☆28Updated 2 years ago
- ☆30Updated 4 years ago
- Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding☆10Updated last year
- code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)☆64Updated 2 years ago
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25Updated last year
- Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"☆11Updated 3 years ago
- Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".☆36Updated last year
- ☆38Updated 8 months ago
- Revisiting End-to-End Speech-to-Text Translation From Scratch☆12Updated 2 years ago
- A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021☆47Updated 3 years ago
- ☆38Updated 4 years ago
- ☆59Updated 2 years ago
- ☆18Updated 10 months ago
- Pre-training Cross-modal Transformer for Audio-and-Language Representations☆39Updated 4 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆19Updated 4 months ago
- Code for ACL 2022 main conference paper "Modeling Dual Read/Write Paths for Simultaneous Machine Translation"☆12Updated 3 years ago
- Implementation of meta-transfer-learning for ASR and LM (ACL 2020)☆50Updated 4 years ago
- [ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech☆23Updated 3 years ago
- ☆34Updated 3 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆62Updated 3 years ago
- Jointly encoding word confusion networks (WCNs) and dialogue context with BERT for spoken language understanding (SLU).☆13Updated last year
- Code repository for the Cantonese In-car Audio-Visual Speech Recognition (CI-AVSR) dataset.☆38Updated 9 months ago
- Unsupervised spoken sentence embeddings☆14Updated 2 years ago