jefflai108 / Semi-Supervsied-Spoken-Language-Understanding-PyTorch
Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining
☆12Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Semi-Supervsied-Spoken-Language-Understanding-PyTorch
- ☆24Updated 4 years ago
- Unsupervised spoken sentence embeddings☆14Updated last year
- ☆36Updated 2 years ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆33Updated last year
- Refactored version of https://github.com/ming024/FastSpeech2☆13Updated 3 years ago
- ☆9Updated last year
- ☆16Updated 2 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆30Updated 2 years ago
- Temporary anonymous version☆22Updated 8 months ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆16Updated 8 months ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 3 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Updated 2 years ago
- Token-Level Supervised Contrastive Learning for Punctuation Restoration☆29Updated 3 years ago
- ☆25Updated 2 years ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆46Updated last year
- ☆13Updated last month
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- ☆31Updated last year
- A handy dataset of noises for ASR☆19Updated 5 years ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆44Updated last year
- ☆37Updated 3 years ago
- ☆20Updated 3 years ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆20Updated 3 months ago
- ☆22Updated 5 years ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 4 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆18Updated 8 months ago