mozilla-ai / speech-to-text-finetuneLinks
Blueprint by Mozilla.ai for finetuning a Speech-To-Text model in your own language
☆38Updated 2 months ago
Alternatives and similar repositories for speech-to-text-finetune
Users that are interested in speech-to-text-finetune are comparing it to the libraries listed below
Sorting:
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆47Updated this week
- Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ …☆45Updated this week
- Joint speech-language model - respond directly to audio!☆30Updated last year
- Visualization and sparse autoencoder training for mechanistic interpretability on audio models☆20Updated 2 months ago
- ☆205Updated last year
- Create an LJSpeech structured voice dataset on wave input☆30Updated 8 months ago
- ☆38Updated 6 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 6 months ago
- ☆21Updated last month
- Speaker Diarization with Transformers☆67Updated 2 weeks ago
- Blueprint by Mozilla.ai for answering questions about structured documents☆34Updated 3 months ago
- Collection of Open Source Speech Data☆159Updated 7 months ago
- ☆38Updated last month
- A python package for whisper normalizer☆60Updated this week
- Use an appropriate mix of LLMs based on https://nuenki.app/blog research to translate languages better than any one tool.☆22Updated last month
- Blueprint to Build Your Own Timeline Algorithm☆58Updated 2 weeks ago
- ☆139Updated 11 months ago
- kokoro text to speech using javascript☆57Updated 4 months ago
- Granite 3.1 Language Models☆112Updated 6 months ago
- Template that can be used to start your own Blueprint.☆15Updated 2 months ago
- ☆26Updated 6 months ago
- Calling LLM APIs on a Raspberry Pi for lulz☆24Updated 2 years ago
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆190Updated 3 months ago
- ☆13Updated 3 months ago
- Shoonya - Platform to Annotate and label data at scale.☆54Updated 9 months ago
- ☆20Updated last year
- Interroger à l'aveugle deux modèles de langage conversationnels sur des tâches exprimées en français et comparer les résultats.☆37Updated this week
- Joint speech-language model - respond directly to audio!☆369Updated 11 months ago
- Minimal example of MCP for parsing llms.txt☆38Updated 2 months ago
- AI core services for Jitsi☆57Updated this week