sovse/base_rus_whisper_stt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sovse/base_rus_whisper_stt)

sovse / base_rus_whisper_stt

Fine tuning of the base model from OpenAI Whisper in Russian language on the dataset Sber-golos

☆39

Alternatives and similar repositories for base_rus_whisper_stt

Users that are interested in base_rus_whisper_stt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sovse / Rus-SpeechRecognition-LSTM-CTC-VoxForge
View on GitHub
Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge
☆58Sep 16, 2022Updated 3 years ago
alphacep / awesome-russian-speech
View on GitHub
Russian speech technology links
☆405Mar 17, 2026Updated 4 months ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
Raumberg / myllm
View on GitHub
Multi-node distributed LLM training framework
☆17Sep 5, 2025Updated 10 months ago
WangHelin1997 / Aty-TTS
View on GitHub
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
☆11May 14, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
ProjectEGU / whisper-for-low-vram
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆29Dec 16, 2023Updated 2 years ago
sushant-t / tts-trainer
View on GitHub
Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…
☆30May 27, 2023Updated 3 years ago
milenagazdieva / LightUnbalancedOptimalTransport
View on GitHub
PyTorch implementation of "Light Unbalanced Optimal Transport" (NeurIPS 2024)
☆22Dec 23, 2024Updated last year
bayartsogt-ya / whisper-multiple-hf-datasets
View on GitHub
Whisper fine-tuning event script to use multiple hf datasets
☆32Dec 20, 2022Updated 3 years ago
bookbot-hive / k2-indonesian-asr
View on GitHub
Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).
☆16Jun 30, 2023Updated 3 years ago
ainy / shershe
View on GitHub
Speech recognition dataset based on russian audiobook, sentance-level split
☆18Oct 6, 2018Updated 7 years ago
AIRI-Institute / AI4TALK
View on GitHub
☆13Dec 7, 2022Updated 3 years ago
nolanritchie / Super-Survivors
View on GitHub
NPC Mod for Project Zomboid
☆16Aug 14, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
alexisdmacintyre / SpeechBreathingToolbox
View on GitHub
Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.
☆11Feb 17, 2024Updated 2 years ago
ductuantruong / speaker_age_estimation_ssl_study
View on GitHub
[APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Oct 19, 2022Updated 3 years ago
mbebenita / j2me.js
View on GitHub
J2ME VM in JavaScript
☆10Sep 23, 2015Updated 10 years ago
pytorch-lifestream / ptls-experiments
View on GitHub
Experiments on public datasets for pytorch-lifestream library
☆20Nov 20, 2024Updated last year
LingweiMeng / Whisper-Sidecar
View on GitHub
The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".
☆34Aug 2, 2025Updated 11 months ago
vadimtimakin / end2end-HKR-research
View on GitHub
The solution and code for NTO AI Olympics 2022.
☆19Sep 20, 2022Updated 3 years ago
ekapolc / gowajee_corpus
View on GitHub
Thai smart home corpus with "Gowajee" hotword
☆19Jul 30, 2023Updated 2 years ago
wuyushuwys / FMEDiffusion
View on GitHub
[NeurIPS2024] Fast and Memory-Efficient Video Diffusion Using Streamlined Inference
☆18Dec 3, 2024Updated last year
NTRLab / MediaSpeech
View on GitHub
☆22Jul 22, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Kutuz4 / ImplicitlyNormalizedForecasterWithClipping
View on GitHub
☆20Nov 26, 2023Updated 2 years ago
GameDevEducation / UnityTutorial_SimsStyleAI
View on GitHub
☆16Jun 19, 2023Updated 3 years ago
DeevsDeevs / TenderHack
View on GitHub
Development of a prototype engine for searching for goods on the tender procurement portal
☆27Oct 25, 2022Updated 3 years ago
mush42 / mantoq
View on GitHub
Arabic Grapheme-to-Phoneme (G2P) Conversion
☆16Mar 15, 2025Updated last year
jisang93 / VISinger
View on GitHub
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆20May 12, 2023Updated 3 years ago
ivkireev86 / datafusion-contest-2022
View on GitHub
☆25Oct 6, 2022Updated 3 years ago
isadrtdinov / bootcamp-idao-2022
View on GitHub
IDAO 2022: Machine Learning Bootcamp
☆19Dec 4, 2021Updated 4 years ago
viktor-z / fb2pdf
View on GitHub
fb2 to PDF converter
☆10Apr 17, 2026Updated 3 months ago
salute-developers / golos
View on GitHub
☆148May 21, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
marxoft / musikloud2
View on GitHub
A SoundCloud client and music player that can be extended via plugins.
☆11Jan 29, 2017Updated 9 years ago
AI4Bharat / IndicVoices
View on GitHub
☆19Feb 22, 2026Updated 5 months ago
corycorvus / Unity-Speech-to-Text
View on GitHub
his plugin interfaces Windows streaming, Wit.ai non-streaming, Google streaming/non-streaming, and IBM Watson streaming/non-streaming spe…
☆26Oct 10, 2017Updated 8 years ago
shun60s / Vocal-Tube-Model
View on GitHub
a very simple vocal tract model, few tube model. generate vowel sound by it
☆18Jun 27, 2026Updated last month
laboroai / TEDxJP-10K
View on GitHub
☆26Jan 14, 2021Updated 5 years ago
FloaterTS / RTSUnityGameLicenta
View on GitHub
Unity 3D RTS Game
☆14Jun 30, 2021Updated 5 years ago
Malkovsky / interactive-visualization
View on GitHub
☆30May 5, 2024Updated 2 years ago