egorsmkv / whisper-ukrainian

Trainer and Evaluation scripts for fine-tuning Whisper models for the Ukrainian language

☆15

Related projects ⓘ

Alternatives and complementary repositories for whisper-ukrainian

rhasspy / piper-phonemize
C++ library for converting text to phonemes for Piper
☆89Updated 8 months ago
alphacep / vosk-tts
Text To Speech Synthesis with Vosk
☆128Updated 3 months ago
Mastering-Python-GT / Transcription-diarization-whisper-pyannote
Transcription and diarization (speaker identification)
☆28Updated last year
cvqluu / simple_diarizer
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
☆140Updated 6 months ago
vasistalodagala / whisper-finetune
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
☆251Updated last year
hanifabd / voice-activity-detection-vad-realtime
Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)
☆54Updated 5 months ago
roatienza / efficientspeech
PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.
☆156Updated 7 months ago
justinwlin / runpodWhisperx
Runpod WhisperX Docker Container Repo
☆11Updated 8 months ago
appvoid / vosper
Real-Time Whisper Voice Recognition with vosk model feedback.
☆105Updated last year
awexandrr / audioWhisper
Listen to any audio stream on your machine and print out the transcribed or translated audio.
☆114Updated last year
davidmartinrius / speech-dataset-generator
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.
☆204Updated 5 months ago
sidhantls / lexpod-speaker-prediction
Speaker prediction for captions on the Lex Fridman podcast
☆23Updated 8 months ago
Picovoice / koala
On-device noise suppression powered by deep learning
☆62Updated last month
rendchevi / nix-tts
🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation
☆239Updated last year
Den4ikAI / ruphon
Простой IPA фонемизатор на базе ruaccent-encoder
☆14Updated 3 weeks ago
hedrergudene / asr-sd-pipeline
Speech recognition & diarisation solution with text alignment, deployed in AML pipelines
☆84Updated 6 months ago
souvikg544 / TTS_Data_Maker
Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…
☆27Updated last year
jumon / zac
Zero-shot Audio Classification using Whisper
☆74Updated last year
FENRlR / MB-iSTFT-VITS2
Application of MB-iSTFT-VITS components to vits2_pytorch
☆114Updated this week
miguelvalente / whisperer
Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.
☆133Updated last year
prateekralhan / OpenAI_Whisper_ASR
A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models
☆65Updated 2 years ago
HKAB / whisper-finetune-vietnamese
Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM
☆33Updated last year
coqui-ai / xtts-streaming-server
☆295Updated 4 months ago
nalbion / whisper-server
streaming speech to text server using Whisper
☆83Updated last year
ccoreilly / wav2vec2-service
☆38Updated 2 years ago
jasonppy / PromptingWhisper
Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation
☆132Updated 9 months ago
yl4579 / StyleTTS
Official Implementation of StyleTTS
☆398Updated 11 months ago
MartinMashalov / VoiceCloning
Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…
☆46Updated last year
ZDisket / TensorVox
Desktop application for neural speech synthesis written in C++
☆210Updated last year
dbklim / RNNoise_Wrapper
A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions for…
☆141Updated 5 months ago