voithru/wav2vec2_finetune

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/voithru/wav2vec2_finetune)

voithru / wav2vec2_finetune

Wav2Vec2 finetune and inference code for IITP AI Grand Challenge

☆36

Alternatives and similar repositories for wav2vec2_finetune

Users that are interested in wav2vec2_finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

voithru / asr-text_classification-pipeline
View on GitHub
☆21Feb 21, 2022Updated 4 years ago
voithru / voice-activity-detection
View on GitHub
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
☆159Oct 26, 2021Updated 4 years ago
baikalai / baikal-bert
View on GitHub
baikal.ai's pre-trained BERT models: descriptions and sample codes
☆12Jun 24, 2021Updated 5 years ago
snunlp / KR-ELECTRA
View on GitHub
KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch
☆15Feb 13, 2022Updated 4 years ago
qinyuenlp / wav2vec_finetune
View on GitHub
ASR: fine-tune wav2vec 2.0 with transformers
☆21Sep 13, 2021Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
ko-nlp / moducorpus-sanitizer
View on GitHub
모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.
☆11Mar 2, 2022Updated 4 years ago
LG-AI-EXAONE / KMMLU-Pro
View on GitHub
☆16Aug 18, 2025Updated 11 months ago
noowad93 / chosung-translator
View on GitHub
초성 해석기 based on ko-BART
☆29Mar 31, 2021Updated 5 years ago
phillip0726 / NaverBlog-Twitter-Youtube-crawling
View on GitHub
We can crawl NaverBlog, Twitter, Youtube!!
☆13Sep 13, 2019Updated 6 years ago
sooftware / lightning-asr
View on GitHub
Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
☆50May 19, 2021Updated 5 years ago
daanzu / wav2vec2_stt_python
View on GitHub
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…
☆23Aug 16, 2021Updated 4 years ago
sooftware / speech-transformer
View on GitHub
Transformer implementation speciaized in speech recognition tasks using Pytorch.
☆65Nov 28, 2021Updated 4 years ago
scarletcho / KoLM
View on GitHub
Korean text normalization and language preparation package for LM in Kaldi-based ASR system
☆64Apr 23, 2020Updated 6 years ago
patrickvonplaten / Wav2Vec2_PyCTCDecode
View on GitHub
Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode
☆110Aug 31, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
upskyy / ContextNet
View on GitHub
PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…
☆38Feb 27, 2022Updated 4 years ago
songys / 2021Langcon
View on GitHub
☆11Oct 3, 2021Updated 4 years ago
sooftware / speech-paper-review
View on GitHub
Review of papers I read
☆14Dec 11, 2020Updated 5 years ago
deepaudio / deepaudio-speaker
View on GitHub
neural network based speaker embedder
☆24Jan 7, 2023Updated 3 years ago
sooftware / luna-transformer
View on GitHub
A PyTorch Implementation of the Luna: Linear Unified Nested Attention
☆41Jul 29, 2021Updated 5 years ago
JoungheeKim / kor-spacing
View on GitHub
This is project for korean auto spacing
☆12Aug 3, 2020Updated 5 years ago
upskyy / Paper-Review
View on GitHub
Paper Review about Speech Recognition · NLP
☆10Mar 25, 2021Updated 5 years ago
tunib-ai / joker
View on GitHub
AI model designed to test the effectiveness in handling external ethical attacks.
☆11Feb 9, 2026Updated 5 months ago
sooftware / deepspeech2
View on GitHub
PyTorch implementation of "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin" (ICML, 2016)
☆29Mar 5, 2021Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
gauthamsuresh09 / wav2vec2-large-xlsr-53-malayalam
View on GitHub
Wav2vec2 Large XLSR 53 fine-tuned for Malayalam
☆11Sep 7, 2021Updated 4 years ago
detail-novelist / novelist-triton-server
View on GitHub
Deploy KoGPT with Triton Inference Server
☆14Nov 18, 2022Updated 3 years ago
speech-paper-reading / speech-paper-reading
View on GitHub
Repository for speech paper reading
☆33Aug 19, 2021Updated 4 years ago
sooftware / nlp-tasks
View on GitHub
Natural Language Processing Tasks and Examples.
☆62Aug 17, 2022Updated 3 years ago
upskyy / Automatic-Speech-Recognition-Models
View on GitHub
End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
☆10Jan 21, 2022Updated 4 years ago
jason9693 / FROZEN
View on GitHub
☆14May 3, 2022Updated 4 years ago
SMART-TTS / SMART-G2P
View on GitHub
☆103Mar 24, 2023Updated 3 years ago
sooftware / End-to-End-Speech-Recognition-Models
View on GitHub
PyTorch implementation of automatic speech recognition models.
☆38Jan 10, 2021Updated 5 years ago
songys / AwesomeKorean_Speech
View on GitHub
음성인식과 신호처리
☆14Sep 12, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
KrishnaDN / BERTphone
View on GitHub
Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
☆17Dec 10, 2020Updated 5 years ago
hchung12 / espnet-asr
View on GitHub
☆37Dec 23, 2020Updated 5 years ago
tunib-ai / transformers
View on GitHub
🚀 Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeed
☆31Feb 5, 2022Updated 4 years ago
sooftware / jasper
View on GitHub
PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)
☆32Mar 4, 2021Updated 5 years ago
Xilinx / libdfx
View on GitHub
☆13Jun 14, 2026Updated last month
nc-ai / speech
View on GitHub
☆17Aug 27, 2025Updated 11 months ago
gururise / openai_text_generation_inference_server
View on GitHub
Use OpenAI with HuggingChat by emulating the text_generation_inference_server
☆44Jun 25, 2023Updated 3 years ago