mozilla-ai / speech-to-text-finetune
Blueprint by Mozilla.ai for finetuning a Speech-To-Text model in your own language
☆35Updated last week
Alternatives and similar repositories for speech-to-text-finetune:
Users that are interested in speech-to-text-finetune are comparing it to the libraries listed below
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 3 months ago
- Joint speech-language model - respond directly to audio!☆30Updated 10 months ago
- ☆201Updated 10 months ago
- ☆117Updated 9 months ago
- AI core services for Jitsi☆54Updated this week
- Small python package to measure OCR quality and other related metrics.☆21Updated last year
- Speaker Diarization with Transformers☆64Updated 10 months ago
- On-device streaming text-to-speech engine powered by deep learning☆73Updated this week
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆47Updated 8 months ago
- ☆67Updated last year
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆60Updated this week
- Video+code lecture on building nanoGPT from scratch☆66Updated 9 months ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆47Updated 3 weeks ago
- Using modal.com to process FineWeb-edu data☆20Updated 3 weeks ago
- Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIs☆43Updated last year
- In-browser LLM website generator☆48Updated 2 months ago
- A locally trained model of Stoney Nakoda has been developed and released. You can access the working model here or train your own instanc…☆10Updated 3 weeks ago
- streaming speech to text server using Whisper☆89Updated last year
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆50Updated last month
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆87Updated 3 months ago
- ☆66Updated 10 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆61Updated 3 weeks ago
- Joint speech-language model - respond directly to audio!☆369Updated 9 months ago
- Speaker diarization service☆21Updated last month
- Collection of Open Source Speech Data☆152Updated 4 months ago
- Uses ChatGPT to auto create a course that renders in liascript (more formats coming soon)☆31Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- Blueprint by Mozilla.ai for generating podcasts from documents using local AI☆81Updated last week
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆66Updated this week
- ☆18Updated last year