mozilla-ai / speech-to-text-finetune
Blueprint by Mozilla.ai for finetuning a Speech-To-Text model in your own language
☆36Updated last month
Alternatives and similar repositories for speech-to-text-finetune
Users that are interested in speech-to-text-finetune are comparing it to the libraries listed below
Sorting:
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 5 months ago
- ☆20Updated last week
- Joint speech-language model - respond directly to audio!☆30Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last month
- Collection of Open Source Speech Data☆157Updated 6 months ago
- ☆30Updated 2 weeks ago
- Small python package to measure OCR quality and other related metrics.☆21Updated last year
- Open-source and reproducible benchmarks for Speaker Diarization☆24Updated last month
- Blueprint by Mozilla.ai for generating podcasts from documents using local AI☆92Updated this week
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆22Updated 10 months ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- ☆36Updated 5 months ago
- Template that can be used to start your own Blueprint.☆14Updated last month
- Speaker Diarization with Transformers☆64Updated 11 months ago
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆189Updated 2 months ago
- Create an LJSpeech structured voice dataset on wave input☆30Updated 7 months ago
- Minimal example of MCP for parsing llms.txt☆38Updated last month
- On-device streaming text-to-speech engine powered by deep learning☆79Updated last week
- kokoro text to speech using javascript☆57Updated 3 months ago
- ☆26Updated 5 months ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆47Updated this week
- ☆204Updated 11 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆26Updated 9 months ago
- ☆155Updated last year
- A python package for whisper normalizer☆59Updated last week
- This is a Raspberry Pi 5 whisper C++ voice assistant - backwards compatible with Pi4☆21Updated last year
- ☆30Updated 10 months ago
- image-to-text model for PDF.js☆36Updated 2 months ago
- Audio tokenization, in the fastest way possible!☆52Updated 8 months ago
- ☆124Updated 10 months ago