khanld / ASR-Wav2vec-FinetuneView external linksLinks
Finetune Wa2vec 2.0 For Speech Recognition
☆145Feb 6, 2025Updated last year
Alternatives and similar repositories for ASR-Wav2vec-Finetune
Users that are interested in ASR-Wav2vec-Finetune are comparing it to the libraries listed below
Sorting:
- Wav2vec 2.0 Self-Supervised Pretraining☆58Feb 6, 2025Updated last year
- ASR: fine-tune wav2vec 2.0 with transformers☆21Sep 13, 2021Updated 4 years ago
- Vietnamese Punctuation Prediction using Pretrained Language Models☆14May 8, 2022Updated 3 years ago
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆66Jan 1, 2025Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated last month
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆59Oct 23, 2024Updated last year
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆105Sep 3, 2021Updated 4 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Mar 23, 2021Updated 4 years ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 3 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆15Mar 26, 2025Updated 10 months ago
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆381Nov 22, 2021Updated 4 years ago
- INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"☆116Jan 26, 2024Updated 2 years ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- Official code for paper:"Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding"☆28Jan 28, 2026Updated 2 weeks ago
- SSL Layerwise analysis for speech deepfake detection☆32Aug 5, 2025Updated 6 months ago
- ☆68Dec 30, 2025Updated last month
- Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation☆18May 17, 2023Updated 2 years ago
- A Weakly Supervised Forced Alignment for disluent speech☆15Nov 12, 2023Updated 2 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆274Apr 2, 2022Updated 3 years ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆153Oct 26, 2021Updated 4 years ago
- ☆25Jul 20, 2021Updated 4 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25May 6, 2019Updated 6 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- ☆18Mar 4, 2023Updated 2 years ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆22Updated this week
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆31Aug 30, 2025Updated 5 months ago
- ☆47Aug 31, 2024Updated last year
- [INTERSPEECH'24] Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection☆54Dec 4, 2024Updated last year
- VietASR - Vietnamese Automatic Speech Recognition☆163Oct 29, 2024Updated last year
- ☆16Dec 23, 2021Updated 4 years ago
- ☆15Aug 22, 2025Updated 5 months ago
- ☆34Jun 9, 2025Updated 8 months ago
- Goodness of Pronunciation algorithm using PyKaldi☆18Jun 12, 2022Updated 3 years ago
- ☆14Jul 24, 2025Updated 6 months ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA☆102Jun 21, 2024Updated last year