Picovoice / orca
On-device streaming text-to-speech engine powered by deep learning
β69Updated this week
Alternatives and similar repositories for orca:
Users that are interested in orca are comparing it to the libraries listed below
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ150Updated 7 months ago
- On-device speaker recognition engine powered by deep learningβ32Updated this week
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.β38Updated 2 months ago
- β94Updated 9 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β56Updated this week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ92Updated 9 months ago
- Efficient approach to speaker diarization using voice characteristics extractionβ87Updated 9 months ago
- G2Pβ107Updated this week
- Joint speech-language model - respond directly to audio!β30Updated 9 months ago
- Joint speech-language model - respond directly to audio!β364Updated 7 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restorationβ118Updated last week
- β254Updated 11 months ago
- On-device speaker diarization powered by deep learningβ36Updated this week
- fast state-of-the-art speech models and a runtime that runs anywhere π₯β54Updated last week
- Uses deepgram/whisper/custom models to create an LJSpeech dataset for voice model fine tuningβ25Updated this week
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.β49Updated 2 months ago
- whisper.cpp bindings for pythonβ86Updated last year
- Collection of Open Source Speech Dataβ151Updated 3 months ago
- π§ | RunPod worker of the faster-whisper model for Serverless Endpoint.β83Updated last week
- Create an LJSpeech structured voice dataset on wave inputβ25Updated 4 months ago
- β91Updated 3 weeks ago
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficientβ¦β43Updated this week
- β110Updated 7 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.β60Updated last year
- β153Updated last year
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.β44Updated last month
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesisβ320Updated this week
- A ggml (C++) re-implementation of tortoise-ttsβ176Updated 5 months ago
- Speaker diarization serviceβ21Updated last month
- Google's SoundStorm: Efficient Parallel Audio Generationβ131Updated last year