pashanitw/W2V2-BERT-ASR-Training

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pashanitw/W2V2-BERT-ASR-Training)

pashanitw / W2V2-BERT-ASR-Training

☆15

Alternatives and similar repositories for W2V2-BERT-ASR-Training

Users that are interested in W2V2-BERT-ASR-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

baikalai / baikal-bert
View on GitHub
baikal.ai's pre-trained BERT models: descriptions and sample codes
☆12Jun 24, 2021Updated 5 years ago
Sreyan88 / Disfluency-Detection-with-Span-Classification
View on GitHub
This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…
☆14Jun 6, 2023Updated 3 years ago
yichen14 / FastAdaSP
View on GitHub
Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)
☆17Nov 14, 2024Updated last year
rithiksachdev / PostASR-Correction-SLT2024
View on GitHub
☆18Jul 22, 2024Updated 2 years ago
tzyll / ChineseHP
View on GitHub
Dataset for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models in Interspeech 2024.
☆16Jul 4, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Hypotheses-Paradise / UADF
View on GitHub
☆17May 5, 2024Updated 2 years ago
YuanGongND / llm_speech_emotion_challenge
View on GitHub
☆23Jun 24, 2024Updated 2 years ago
utter-project / mHuBERT-147-scripts
View on GitHub
Collection of scripts from mHuBERT-147.
☆35Nov 19, 2024Updated last year
bostxavier / Serial-Speakers
View on GitHub
Companion toolkit of the 'Serial Speakers' dataset.
☆11Feb 17, 2020Updated 6 years ago
the-bird-F / GLM-Voice-RAG
View on GitHub
[EMNLP 2025 Findings] A complete cross-modal RAG system for end-to-end speech-to-speech large models, including ASR-based Retrieval and E…
☆31Jul 11, 2025Updated last year
warisqr007 / ppg2ppg
View on GitHub
Zero-Shot Foreign Accent Conversion without a Native Reference
☆36May 1, 2024Updated 2 years ago
naver-ai / RapFlow-TTS
View on GitHub
☆56Jul 16, 2025Updated last year
orcaman / ai-jumpstart
View on GitHub
☆13Nov 11, 2023Updated 2 years ago
KunHanKH / GE2E_Speaker_Verification
View on GitHub
Most Complete Pytorch Imeplementation "GENERALIZED END-TO-END LOSS FOR SPEAKER VERIFICATION"
☆10Mar 11, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Melvin-Var / ConjointElectricVehicles
View on GitHub
Dashboard showcasing Conjoint Analysis for the Electric Vehicle Lease Market (as at January 2020) in San Francisco
☆15Feb 19, 2020Updated 6 years ago
sooftware / taKotron2
View on GitHub
Tacotron2 for Korean (taKotron2)
☆34Apr 8, 2022Updated 4 years ago
ko-nlp / moducorpus-sanitizer
View on GitHub
모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.
☆11Mar 2, 2022Updated 4 years ago
clovaai / aqm-plus
View on GitHub
PyTorch code for Large-Scale Answerer in Questioner's Mind for Visual Dialog Question Generation (AQM+)
☆10Feb 12, 2019Updated 7 years ago
NVIDIA / RAD-MMM
View on GitHub
A TTS model that makes a speaker speak new languages
☆76Jun 18, 2024Updated 2 years ago
khanld / Wav2vec2-Pretraining
View on GitHub
Wav2vec 2.0 Self-Supervised Pretraining
☆61Feb 6, 2025Updated last year
aispeech-lab / w2v-cif-bert
View on GitHub
☆37Jun 28, 2021Updated 5 years ago
jim-meyer / lottery_ticket_pruner
View on GitHub
(Personal project) Pruning algorithm for DNNs using "lottery ticket" pruning
☆10Dec 8, 2022Updated 3 years ago
ddofer / talk
View on GitHub
Slides and talks from presentations, workshops, etc'
☆19Feb 1, 2026Updated 5 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
biboamy / FBA-Fall19
View on GitHub
☆10Mar 10, 2021Updated 5 years ago
kamperh / globalphone_awe
View on GitHub
Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.
☆11Nov 3, 2020Updated 5 years ago
OscarVanL / LibriTTS-British-Accents
View on GitHub
A subset of the popular LibriTTS dataset with subsets for English, Scottish, Welsh, and Irish accents.
☆16Mar 17, 2023Updated 3 years ago
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
jfainberg / sincnet_adapt
View on GitHub
Raw waveform adaptation with SincNet
☆12Mar 19, 2024Updated 2 years ago
boostcampaitech3 / final-project-level3-cv-17
View on GitHub
[2022.05.16 ~ 2022.06.10] 🌤️미세먼지 없는 맑은 사진📷 - 부스트캠프 AI Tech 3기 최종 프로젝트
☆14Jun 11, 2022Updated 4 years ago
rdisipio / qnlp
View on GitHub
NLP stuff with quantum computing
☆17Nov 9, 2020Updated 5 years ago
kyegomez / USM
View on GitHub
Implementation of Google's USM speech model in Pytorch
☆36Updated this week
nafiuny / ICRCycleGAN-VC
View on GitHub
Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny
☆15Apr 15, 2026Updated 3 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
facebookresearch / llama-hd-dataset
View on GitHub
This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.
☆22Jan 22, 2024Updated 2 years ago
HLasse / multidiagnosis-speech
View on GitHub
☆10Jun 23, 2023Updated 3 years ago
azmat21 / UyghurTextResource
View on GitHub
uyghur text resource crawled from website
☆12Dec 25, 2015Updated 10 years ago
zqs01 / data2vecnoisy
View on GitHub
☆11Oct 20, 2022Updated 3 years ago
slSeanWU / beats-conformer-bart-audio-captioner
View on GitHub
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…
☆41Jan 6, 2024Updated 2 years ago
tenebo / g2pk2
View on GitHub
Updated folk of g2pk
☆13Aug 18, 2023Updated 2 years ago
tqjxlm / Simple-DQN-Pytorch
View on GitHub
A simplistic implementation of DQN that works under CartPole-v0 with rendered pixels as input
☆13Feb 28, 2019Updated 7 years ago