mozilla-ai / speech-to-text-finetune
Blueprint by Mozilla.ai for finetuning a Speech-To-Text model in your own language
☆36Updated 2 weeks ago
Alternatives and similar repositories for speech-to-text-finetune:
Users that are interested in speech-to-text-finetune are comparing it to the libraries listed below
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 4 months ago
- Minimal example of MCP for parsing llms.txt☆35Updated 2 weeks ago
- Joint speech-language model - respond directly to audio!☆30Updated 11 months ago
- ☆36Updated 4 months ago
- kokoro text to speech using javascript☆55Updated 2 months ago
- Visualization and sparse autoencoder training for mechanistic interpretability on audio models☆20Updated 3 weeks ago
- Small python package to measure OCR quality and other related metrics.☆21Updated last year
- ☆19Updated last week
- ☆123Updated 10 months ago
- Toolkit for training/converting LibreTranslate compatible language models 🚂☆51Updated 3 weeks ago
- ☆204Updated 11 months ago
- Speaker Diarization with Transformers☆64Updated 11 months ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆47Updated this week
- Tools to make language models a bit easier to use☆43Updated last week
- Blueprint by Mozilla.ai for generating podcasts from documents using local AI☆90Updated this week
- image-to-text model for PDF.js☆36Updated last month
- ☆67Updated last year
- ☆30Updated 9 months ago
- Simple customizable evaluation for text retrieval performance of Sentence Transformers embedders on PDFs☆25Updated 3 months ago
- Collection of Open Source Speech Data☆153Updated 5 months ago
- Create an LJSpeech structured voice dataset on wave input☆28Updated 7 months ago
- In-browser LLM website generator☆49Updated 2 months ago
- A library for working with prompt templates locally or on the Hugging Face Hub.☆45Updated last month
- a simple system for 2-way interruptible voice interactions between human and LLM☆28Updated last year
- Arxflix turns your boring Arxiv research paper into a captivating video.☆47Updated 4 months ago
- Match your resume with a job, effortlessly☆20Updated this week
- ☆66Updated 11 months ago
- Template that can be used to start your own Blueprint.☆12Updated 2 weeks ago
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆187Updated 2 months ago
- lossily compress representation vectors using product quantization☆50Updated last week