CoraJung / flexible-input-sluLinks

This setup allows to train end-to-end neural models for spoken language understanding (SLU).

☆11

Alternatives and similar repositories for flexible-input-slu

Users that are interested in flexible-input-slu are comparing it to the libraries listed below

Sorting:

MiuLab / SpokenVec
Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding
☆24Updated 3 years ago
ReneeYe / ConST
code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)
☆65Updated 3 years ago
MiuLab / SpokenCSE
Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding
☆11Updated 2 years ago
hqsiswiliam / punctuation-restoration-scl
Token-Level Supervised Contrastive Learning for Punctuation Restoration
☆29Updated 4 years ago
ictnlp / STEMM
Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".
☆36Updated 2 years ago
raotnameh / End-to-end-E2E-Named-Entity-Recognition-from-English-Speech
☆31Updated 5 years ago
dqqcasia / st
End-to-end Speech Translation
☆35Updated 4 years ago
thunlp / duplex-model
☆42Updated last year
Alibaba-NLP / AISHELL-NER
[ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech
☆24Updated 3 years ago
duyichao / E2E-ST-TDA
Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"
☆17Updated 4 years ago
ictnlp / GMA
Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"
☆11Updated 3 years ago
ReneeYe / XSTNet
This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)
☆19Updated 3 years ago
MingLunHan / CIF-ColDec
[ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection
☆25Updated 2 years ago
pswietojanski / slurp
Repository for SLURP paper
☆107Updated 3 years ago
ShiningLab / POS-Tagger-for-Punctuation-Restoration
This repository is for the paper Incorporating External POS Tagger for Punctuation Restoration. Proc. Interspeech 2021, 1987-1991, doi: 1…
☆11Updated 2 years ago
zsLin177 / CopyNE
☆20Updated last year
wutong8023 / SpeechRE
☆11Updated 3 years ago
Chia-Hsuan-Lee / Spoken-SQuAD
A spoken question answering dataset on SQUAD
☆50Updated 7 months ago
cwang621 / blsp
BLSP: Bootstrapping Langauge-Speech Pre-training via Behavior Alignment of Continuation Writing
☆57Updated last year
shh1574 / multi-modal-dialogue-dataset
☆23Updated 4 years ago
formiel / fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆18Updated last year
hfutami / distill-bert-for-seq2seq-asr
☆24Updated 5 years ago
HLTCHKUST / CI-AVSR
Code repository for the Cantonese In-car Audio-Visual Speech Recognition (CI-AVSR) dataset.
☆40Updated last year
XL2248 / MSCTD
Code and Data for the ACL22 main conference paper "MSCTD: A Multimodal Sentiment Chat Translation Dataset"
☆42Updated last year
scir-zywang / self-training-self-supervised-disfluency
☆39Updated 4 years ago
danliu2 / caat
☆35Updated 3 years ago
LooperXX / ProSLU
Open source code and data for AAAI 2022 Oral Paper "Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding"
☆35Updated last year
dqqcasia / mosst
☆27Updated 3 years ago
TRUMANCFY / SLIM
Code for "SLIM: Explicit Slot-Intent Mapping with BERT for Joint Multi-Intent Detection and Slot Filling"
☆18Updated 3 years ago
facebookresearch / fbai-speech
Repo for the FB AI Speech team.
☆26Updated 4 years ago