minkjung / blankcollapse
☆9Updated last year
Related projects ⓘ
Alternatives and complementary repositories for blankcollapse
- Refactored version of https://github.com/ming024/FastSpeech2☆13Updated 3 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Updated 3 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆8Updated 2 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 4 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆18Updated 8 months ago
- ☆24Updated 4 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆35Updated 3 years ago
- Balanced Error Rate for Speaker Diarization☆25Updated last year
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆30Updated 2 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆23Updated last year
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆14Updated 4 months ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 3 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 2 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆16Updated 8 months ago
- A pakage for crawling audio from Youtube☆41Updated last year
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Updated 2 years ago
- End-to-end diarization loss☆22Updated 3 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆45Updated 3 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- ☆12Updated 3 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 3 months ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆13Updated last month
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Updated 6 years ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆44Updated last year
- PyTorch implementation of automatic speech recognition models.☆38Updated 3 years ago
- CTC Decoder implementation with python only. Also supports language model decoding using KenLM.☆36Updated 6 months ago
- A collection of papers related to speech model compression☆24Updated last year