Finetune Wa2vec 2.0 For Speech Recognition
☆149Feb 6, 2025Updated last year
Alternatives and similar repositories for ASR-Wav2vec-Finetune
Users that are interested in ASR-Wav2vec-Finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Wav2vec 2.0 Self-Supervised Pretraining☆59Feb 6, 2025Updated last year
- ASR: fine-tune wav2vec 2.0 with transformers☆21Sep 13, 2021Updated 4 years ago
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆67Jan 1, 2025Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 3 years ago
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆379Nov 22, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Vietnamese Punctuation Prediction using Pretrained Language Models☆14May 8, 2022Updated 3 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆105Sep 3, 2021Updated 4 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆273Apr 2, 2022Updated 3 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆15Mar 26, 2025Updated last year
- poorman's ar-dit tts☆45Dec 31, 2025Updated 2 months ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 3 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 11 months ago
- Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation☆18May 17, 2023Updated 2 years ago
- A Weakly Supervised Forced Alignment for disluent speech☆15Nov 12, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription☆78Feb 13, 2026Updated last month
- Pre-trained Wav2vec2.0 for Mandarin☆43Oct 30, 2022Updated 3 years ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆153Oct 26, 2021Updated 4 years ago
- 语音识别 论文 前沿☆52Jan 8, 2022Updated 4 years ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆61Oct 23, 2024Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆361May 23, 2023Updated 2 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80May 20, 2023Updated 2 years ago
- Finetune wav2vec2-large-xlsr-53 with Thai Common Voice Corpus 7.0☆53Apr 23, 2022Updated 3 years ago
- ☆80Aug 8, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Goodness of Pronunciation algorithm using PyKaldi☆18Jun 12, 2022Updated 3 years ago
- VietASR - Vietnamese Automatic Speech Recognition☆165Mar 19, 2026Updated last week
- ☆68Dec 30, 2025Updated 2 months ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25May 6, 2019Updated 6 years ago
- ☆15Oct 8, 2023Updated 2 years ago
- This repo builds an end-to-end deep learning application that supports speech recognition system. It's simple to use and understand☆38May 23, 2023Updated 2 years ago
- ☆15Aug 22, 2025Updated 7 months ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Mar 23, 2021Updated 5 years ago
- Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA☆107Jun 21, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- a simplified version of wav2vec(1.0, vq, 2.0) in fairseq☆170Sep 21, 2020Updated 5 years ago
- wav2vec2 asr with transformers☆16Oct 26, 2021Updated 4 years ago
- ☆25Jul 20, 2021Updated 4 years ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"☆117Jan 26, 2024Updated 2 years ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆36Aug 30, 2025Updated 6 months ago
- ☆32Dec 4, 2022Updated 3 years ago