mozilla-ai / speech-to-text-finetuneLinks
Blueprint by Mozilla.ai for finetuning a Speech-To-Text model in your own language
☆39Updated 3 months ago
Alternatives and similar repositories for speech-to-text-finetune
Users that are interested in speech-to-text-finetune are comparing it to the libraries listed below
Sorting:
- ☆146Updated last year
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆194Updated 4 months ago
- ☆158Updated 2 years ago
- Joint speech-language model - respond directly to audio!☆30Updated last year
- Open Audio Watermarking Tool☆223Updated 2 weeks ago
- On-device streaming text-to-speech engine powered by deep learning☆98Updated this week
- SEPIA server to support open-source speech recognition via WebSocket connection.☆128Updated 8 months ago
- ☆300Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆117Updated 2 years ago
- Speaker Diarization with Transformers☆68Updated last month
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- ☆205Updated last year
- streaming speech to text server using Whisper☆93Updated 2 years ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 7 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆28Updated 11 months ago
- Source code for Mozilla.ai's Lumigator platform☆237Updated last week
- ☆21Updated 2 months ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆503Updated last year
- ☆359Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- whisper.cpp bindings for python☆98Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆214Updated 8 months ago
- ☆38Updated 7 months ago
- A simple, consistent and extendable toolkit for IndicTrans2☆34Updated last month
- Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ …☆46Updated this week
- 🐸STT integration examples☆129Updated 2 years ago
- An automatic speech recognition API☆63Updated this week
- Create an LJSpeech structured voice dataset on wave input☆32Updated 9 months ago
- Joint speech-language model - respond directly to audio!☆371Updated last year
- An even smaller speech recognizer / force aligner☆34Updated 7 months ago