mobiusml / faster-whisper
Faster Whisper ASR transcription with CTranslate2
☆11Updated last week
Related projects: ⓘ
- Speaker diarization service☆17Updated last week
- Whisper_MCE☆13Updated 3 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆27Updated last year
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 3 years ago
- Coqui Inference Engine☆38Updated 3 years ago
- ☆22Updated 3 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 7 months ago
- My public domain speech index☆10Updated 5 years ago
- ☆19Updated 5 years ago
- A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic a…☆16Updated 4 years ago
- Supervoice diffusion enhance☆24Updated 2 months ago
- ☆9Updated 4 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆13Updated last year
- ☆11Updated 9 years ago
- ☆12Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆10Updated 3 weeks ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆16Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆16Updated 3 weeks ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆9Updated 2 months ago
- Self-contained Python package for OpenFst☆50Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆12Updated 3 years ago
- ☆17Updated last year
- Audio tokenization, in the fastest way possible!☆45Updated 3 weeks ago
- BurrMill core☆21Updated 2 years ago
- Evaluation of STT models for german language☆15Updated 2 years ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆16Updated 2 years ago
- Agent toolkit for 100 hours of speech and 10 GiB of text☆13Updated 7 months ago