qinyuenlp/wav2vec_finetune

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/qinyuenlp/wav2vec_finetune)

qinyuenlp / wav2vec_finetune

ASR: fine-tune wav2vec 2.0 with transformers

☆21

Alternatives and similar repositories for wav2vec_finetune

Users that are interested in wav2vec_finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

vistec-AI / wav2vec2-large-xlsr-53-th
View on GitHub
Finetune wav2vec2-large-xlsr-53 with Thai Common Voice Corpus 7.0
☆53Apr 23, 2022Updated 4 years ago
chutaklee / CantoASR
View on GitHub
Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)
☆16May 8, 2022Updated 4 years ago
hyyoka / Acoustic-Features
View on GitHub
audio/speech feature extraction using parselmouth, librosa, disvoice
☆10Jan 28, 2022Updated 4 years ago
yazone / BasedRuleQA_Parser
View on GitHub
基于规则匹配的问答系统中的解析器，the parser of based rule QA system
☆12Mar 13, 2020Updated 6 years ago
du-ud / kaldi-cslt
View on GitHub
☆15Aug 30, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jiay7 / wenet_onlinedecode
View on GitHub
Went online decode demo
☆31Apr 28, 2021Updated 5 years ago
SilvrDuck / AccentedSpeechRecognition
View on GitHub
Experiments on speech recognition robustness to accents and dialects
☆12Apr 2, 2019Updated 7 years ago
voithru / wav2vec2_finetune
View on GitHub
Wav2Vec2 finetune and inference code for IITP AI Grand Challenge
☆36Feb 22, 2022Updated 4 years ago
erasedwalt / CTC-ASR
View on GitHub
An implementation of Jasper, QuartzNet, Citrinet and pipeline for training CTC-based ASR models
☆12Nov 13, 2021Updated 4 years ago
StephenLee2016 / simple-chatbot
View on GitHub
基于规则和相似匹配的闲聊机器人
☆13Nov 8, 2017Updated 8 years ago
jinsongpan / ASR_Course_Homework
View on GitHub
分享在深蓝学院《语音识别：从入门到精通》第一期课程学习过程中完成的课后作业，供参考。
☆21Sep 13, 2020Updated 5 years ago
mailong25 / self-supervised-speech-recognition
View on GitHub
speech to text with self-supervised learning based on wav2vec 2.0 framework
☆380Nov 22, 2021Updated 4 years ago
sarulab-speech / whisper-asr-finetune
View on GitHub
☆32Dec 4, 2022Updated 3 years ago
hchung12 / espnet-asr
View on GitHub
☆37Dec 23, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sbl / chimera
View on GitHub
Auditory chimera
☆13Nov 12, 2017Updated 8 years ago
HuangZiliAndy / SSL_for_multitalker
View on GitHub
ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS
☆33Mar 16, 2023Updated 3 years ago
ixuejiaozhao / SAN-for-Product-Attributes-Prediction
View on GitHub
☆13Sep 23, 2025Updated 10 months ago
Aisaka0v0 / TS-Whisper
View on GitHub
☆33Jun 12, 2025Updated last year
hfutami / distill-bert-for-seq2seq-asr
View on GitHub
☆24Jun 17, 2020Updated 6 years ago
ZQuang2202 / Zipformer_Lightning
View on GitHub
An upgrade framework for train and validate compare with icefall using Lightning.
☆16Mar 26, 2025Updated last year
haotangxjtu / MSCL
View on GitHub
code for Multisample-based Contrastive Loss for Top-k Recommendation (IEEE TMM)
☆10Nov 23, 2022Updated 3 years ago
kehanlu / Mandarin-Wav2Vec2
View on GitHub
Pre-trained Wav2vec2.0 for Mandarin
☆43Oct 30, 2022Updated 3 years ago
jonatasgrosman / wav2vec2-sprint
View on GitHub
☆206Feb 22, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
georgid / AlignmentEvaluation
View on GitHub
Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if tok…
☆18Oct 27, 2020Updated 5 years ago
krylm / whisper-event-tuning
View on GitHub
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
☆12Dec 24, 2022Updated 3 years ago
rakshithShetty / dnn-speech
View on GitHub
This ist the repository for the term project Speech Recognition using Deep Neural Networks for the course ELEC-E5510-Speech Recognition
☆12Dec 8, 2015Updated 10 years ago
ogunlao / glowtts_stdp
View on GitHub
Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor
☆19Jun 5, 2023Updated 3 years ago
TWOEARS / auditory-front-end
View on GitHub
Two!Ears Auditory Model - Auditory front-end module
☆16Jan 24, 2018Updated 8 years ago
rawbeen248 / audio_classification_finetuning
View on GitHub
This project focuses on the classification of animal sounds using deep learning. The core idea is to utilize audio processing techniques …
☆10Dec 3, 2024Updated last year
Ruiqi-Yan / URO-Bench
View on GitHub
Towards Comprehensive Evaluation for End-to-End Spoken Dialogue Models
☆55Sep 2, 2025Updated 10 months ago
YihongSun / MOD-UV
View on GitHub
[ECCV24] MOD-UV: Learning Mobile Object Detectors from Unlabeled Videos
☆11Oct 7, 2024Updated last year
xiangyida / TicTacToe
View on GitHub
基于UDP通信的联机对战，基于JAVAGUI,适合java初学者练习的一款小游戏
☆23Feb 17, 2019Updated 7 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
xinjli / asr2k
View on GitHub
asr2k
☆51Jun 2, 2024Updated 2 years ago
George0828Zhang / simulst
View on GitHub
PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.
☆25Oct 3, 2022Updated 3 years ago
jindongwang / EasyEspnet
View on GitHub
Making Espnet easier to use
☆54Apr 9, 2021Updated 5 years ago
zhangjiatao / Hospital-Report-Demo
View on GitHub
医院体检报告信息抽取及模板生成
☆12Apr 25, 2019Updated 7 years ago
TencentGameMate / chinese_speech_pretrain
View on GitHub
chinese speech pretrained models
☆1,211Aug 23, 2024Updated last year
wyl7 / ClusterSCL
View on GitHub
The pytorch implementation of ClusterSCL (WWW2022).
☆15Apr 20, 2023Updated 3 years ago
tongjinle123 / speech-transformer-pytorch_lightning
View on GitHub
ASR project with pytorch-lightning
☆20Mar 21, 2025Updated last year