bayartsogt-ya/whisper-multiple-hf-datasets

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bayartsogt-ya/whisper-multiple-hf-datasets)

bayartsogt-ya / whisper-multiple-hf-datasets

Whisper fine-tuning event script to use multiple hf datasets

☆32

Alternatives and similar repositories for whisper-multiple-hf-datasets

Users that are interested in whisper-multiple-hf-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ProjectEGU / whisper-for-low-vram
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆29Dec 16, 2023Updated 2 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
AsoSoft / AsoSoft-Speech-Corpus
View on GitHub
AsoSoft Speech Corpus can be used for spoken language processing tasks in Central Kurdish such as speech recognition, speaker recognition…
☆10Mar 8, 2022Updated 4 years ago
alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 5 months ago
sarulab-speech / whisper-asr-finetune
View on GitHub
☆32Dec 4, 2022Updated 3 years ago
ductuantruong / speaker_age_estimation_ssl_study
View on GitHub
[APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Oct 19, 2022Updated 3 years ago
titu1994 / warprnnt_numba
View on GitHub
WarpRNNT loss ported in Numba CPU/CUDA for Pytorch
☆17Mar 11, 2022Updated 4 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
krylm / whisper-event-tuning
View on GitHub
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
☆12Dec 24, 2022Updated 3 years ago
skakouros / s3prl_attentive_correlation
View on GitHub
Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
☆13Nov 18, 2022Updated 3 years ago
MTG / PodcastMix-inference
View on GitHub
☆32Jan 6, 2022Updated 4 years ago
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
babe269 / performant
View on GitHub
A toolset for easy formant extraction and visualization from wav files and TTS models
☆33Sep 2, 2022Updated 3 years ago
Archivoice / nnsvs-chinese-support
View on GitHub
Hed and supporting files for Chinese NNSVS Dataset Creation
☆13Oct 14, 2025Updated 9 months ago
jumon / whisper-finetuning
View on GitHub
[WIP] Scripts for fine-tuning Whisper
☆221Jul 2, 2026Updated 3 weeks ago
voidful / wav2vec2-xlsr-multilingual-56
View on GitHub
56 language, 1 model Multilingual ASR
☆25Jul 25, 2021Updated 5 years ago
vectominist / MiniASR
View on GitHub
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
☆53Dec 6, 2022Updated 3 years ago
xinjli / phonepiece
View on GitHub
phone inventory library
☆17May 15, 2023Updated 3 years ago
bagustris / w2v2-vad
View on GitHub
A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition
☆22Aug 9, 2023Updated 2 years ago
jisang93 / VISinger
View on GitHub
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆20May 12, 2023Updated 3 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
idiap / w2v2-air-traffic
View on GitHub
This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)
☆42Jul 10, 2024Updated 2 years ago
Open-Speech-EkStep / audio-to-speech-pipeline
View on GitHub
This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline
☆33Feb 15, 2023Updated 3 years ago
Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
VKW2021 / kaldi-baseline
View on GitHub
kaldi cnn-tdnnf baseline
☆13Aug 31, 2021Updated 4 years ago
falabrasil / gitlab-resources
View on GitHub
This is a legacy repo. Dev occurs now on GitHub.
☆11Mar 28, 2021Updated 5 years ago
MrEdwards007 / WhisperTaskAcceleration
View on GitHub
Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization
☆25Oct 29, 2022Updated 3 years ago
jindongwang / EasyEspnet
View on GitHub
Making Espnet easier to use
☆54Apr 9, 2021Updated 5 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
FantSun / Speechflow
View on GitHub
Speechflow for emotion recognition related information decomposition
☆10Jul 27, 2021Updated 5 years ago
RazhanHameed / kurdish-llama
View on GitHub
This is an attempt to fine-tune the Llama model for Central Kurdish.
☆17May 24, 2023Updated 3 years ago
uniglot / korean-word-ipa-dictionary
View on GitHub
Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)
☆23Nov 12, 2025Updated 8 months ago
RemiRigal / snreval-python
View on GitHub
This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…
☆12Jun 22, 2022Updated 4 years ago
WangHelin1997 / Aty-TTS
View on GitHub
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
☆11May 14, 2025Updated last year
AsoSoft / AsoSoft-TTS-Speech-Corpus-for-Central-Kurdish
View on GitHub
AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech
☆23Jun 24, 2022Updated 4 years ago
Chung-I / youtube-asr-crawler
View on GitHub
☆10Sep 19, 2022Updated 3 years ago