CoraJung / flexible-input-slu
This setup allows to train end-to-end neural models for spoken language understanding (SLU).
☆11Updated last year
Related projects: ⓘ
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Updated last year
- Token-Level Supervised Contrastive Learning for Punctuation Restoration☆30Updated 3 years ago
- End-to-end Speech Translation☆35Updated 3 years ago
- Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding☆9Updated last year
- ☆26Updated 2 years ago
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆23Updated last year
- ☆27Updated 3 years ago
- ☆24Updated 4 years ago
- This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)☆19Updated 2 years ago
- ☆47Updated 2 years ago
- code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)☆61Updated 2 years ago
- Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"☆18Updated 2 years ago
- BLSP: Bootstrapping Langauge-Speech Pre-training via Behavior Alignment of Continuation Writing☆42Updated 6 months ago
- ☆32Updated 3 years ago
- ☆14Updated 3 months ago
- Jointly encoding word confusion networks (WCNs) and dialogue context with BERT for spoken language understanding (SLU).☆13Updated last year
- This repository is for the paper Incorporating External POS Tagger for Punctuation Restoration. Proc. Interspeech 2021, 1987-1991, doi: 1…☆11Updated last year
- A spoken question answering dataset on SQUAD☆38Updated last year
- ☆37Updated 3 years ago
- [ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech☆21Updated 2 years ago
- ☆16Updated 7 years ago
- Code repository for the Cantonese In-car Audio-Visual Speech Recognition (CI-AVSR) dataset.☆37Updated 2 months ago
- ☆59Updated last year
- Pre-training Cross-modal Transformer for Audio-and-Language Representations☆39Updated 3 years ago
- Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"☆11Updated 2 years ago
- Repo for the FB AI Speech team.☆22Updated 3 years ago
- ☆10Updated this week
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆18Updated last month
- ☆24Updated 11 months ago
- Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)☆27Updated 2 years ago