khanld/ASR-Wav2vec-Finetune

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/khanld/ASR-Wav2vec-Finetune)

khanld / ASR-Wav2vec-Finetune

Finetune Wa2vec 2.0 For Speech Recognition

☆150

Alternatives and similar repositories for ASR-Wav2vec-Finetune

Users that are interested in ASR-Wav2vec-Finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

khanld / Wav2vec2-Pretraining
View on GitHub
Wav2vec 2.0 Self-Supervised Pretraining
☆62Feb 6, 2025Updated last year
qinyuenlp / wav2vec_finetune
View on GitHub
ASR: fine-tune wav2vec 2.0 with transformers
☆21Sep 13, 2021Updated 4 years ago
v-nhandt21 / Vinorm
View on GitHub
Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…
☆67Jan 1, 2025Updated last year
mailong25 / self-supervised-speech-recognition
View on GitHub
speech to text with self-supervised learning based on wav2vec 2.0 framework
☆380Nov 22, 2021Updated 4 years ago
alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ZQuang2202 / Zipformer_Lightning
View on GitHub
An upgrade framework for train and validate compare with icefall using Lightning.
☆16Mar 26, 2025Updated last year
vietai / ASR
View on GitHub
End-to-End Vietnamese Speech Recognition using wav2vec 2.0
☆106Sep 3, 2021Updated 4 years ago
heraclex12 / vietpunc
View on GitHub
Vietnamese Punctuation Prediction using Pretrained Language Models
☆14May 8, 2022Updated 4 years ago
m3hrdadfi / soxan
View on GitHub
Wav2Vec for speech recognition, classification, and audio classification
☆276Apr 2, 2022Updated 4 years ago
lifeiteng / Aligner-SUPERB
View on GitHub
Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark
☆39May 7, 2025Updated last year
yzyouzhang / SASV_PR
View on GitHub
Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"
☆18Jun 24, 2022Updated 4 years ago
zelaki / DisfluentFA
View on GitHub
A Weakly Supervised Forced Alignment for disluent speech
☆15Nov 12, 2023Updated 2 years ago
kehanlu / Mandarin-Wav2Vec2
View on GitHub
Pre-trained Wav2vec2.0 for Mandarin
☆43Oct 30, 2022Updated 3 years ago
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
b04901014 / FT-w2v2-ser
View on GitHub
Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
☆153Oct 26, 2021Updated 4 years ago
TowerYsable / ASR_awesome
View on GitHub
语音识别论文前沿
☆53Jan 8, 2022Updated 4 years ago
AI-S2-Lab / FluentEditor
View on GitHub
[InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency
☆62Oct 23, 2024Updated last year
vasistalodagala / whisper-finetune
View on GitHub
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
☆365May 23, 2023Updated 3 years ago
Edresson / Wav2Vec-Wrapper
View on GitHub
An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.
☆80May 20, 2023Updated 3 years ago
ramizasr21 / comptia-network-n10-009-dumps
View on GitHub
Skillcertpro dumps Priced reasonably at around $20, these resources offer excellent value considering the quality, lifetime access, and u…
☆10Jul 18, 2024Updated 2 years ago
manhph2211 / ViSR
View on GitHub
This repo builds an end-to-end deep learning application that supports speech recognition system. It's simple to use and understand
☆39May 23, 2023Updated 3 years ago
vistec-AI / wav2vec2-large-xlsr-53-th
View on GitHub
Finetune wav2vec2-large-xlsr-53 with Thai Common Voice Corpus 7.0
☆53Apr 23, 2022Updated 4 years ago
Bartelds / asr-augmentation
View on GitHub
Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation
☆18May 17, 2023Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
BUTSpeechFIT / DeCRED
View on GitHub
☆18Aug 13, 2025Updated 11 months ago
JazminVidal / gop-pykaldi
View on GitHub
Goodness of Pronunciation algorithm using PyKaldi
☆19Jun 12, 2022Updated 4 years ago
spring-media / DeepForcedAligner
View on GitHub
☆81Aug 8, 2025Updated 11 months ago
duongttr / mllib-from-scratch
View on GitHub
Building a Machine Learning Library from scratch using Python3, based on SOTA library Scikit-learn
☆15Jan 20, 2023Updated 3 years ago
dangvansam / viet-asr
View on GitHub
VietASR - Vietnamese Automatic Speech Recognition
☆171Jun 18, 2026Updated last month
aalto-speech / interspeech2019_karhila_et_al
View on GitHub
Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…
☆25May 6, 2019Updated 7 years ago
eastonYi / wav2vec
View on GitHub
a simplified version of wav2vec(1.0, vq, 2.0) in fairseq
☆170Sep 21, 2020Updated 5 years ago
aws-samples / amazon-sagemaker-fine-tune-and-deploy-wav2vec2-huggingface
View on GitHub
☆15Oct 8, 2023Updated 2 years ago
v-nhandt21 / Viphoneme
View on GitHub
Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA
☆108Jun 21, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
tarun-bisht / wav2vec2-asr
View on GitHub
wav2vec2 asr with transformers
☆16Oct 26, 2021Updated 4 years ago
aqtq314 / VogenSVS
View on GitHub
☆15Apr 16, 2026Updated 3 months ago
pquochuy / sasegan
View on GitHub
☆25Jul 20, 2021Updated 5 years ago
sarulab-speech / multi-speaker-dgp
View on GitHub
Official implementation of DGP-based multi-speaker speech synthesis with PyTorch
☆24Mar 23, 2021Updated 5 years ago
pyf98 / DPHuBERT
View on GitHub
INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"
☆118Jan 26, 2024Updated 2 years ago
sarulab-speech / whisper-asr-finetune
View on GitHub
☆32Dec 4, 2022Updated 3 years ago
v-nhandt21 / ViMFA
View on GitHub
Montreal Forced Aligner for Vietnamese
☆15Oct 23, 2023Updated 2 years ago