Picovoice / orca
On-device streaming text-to-speech engine powered by deep learning
β76Updated this week
Alternatives and similar repositories for orca:
Users that are interested in orca are comparing it to the libraries listed below
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ158Updated 9 months ago
- β95Updated 11 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ94Updated 11 months ago
- Efficient approach to speaker diarization using voice characteristics extractionβ92Updated 11 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.β48Updated last week
- On-device speaker diarization powered by deep learningβ43Updated last month
- Real-time Speech-Text Foundation Model Toolkit (wip)β224Updated 3 weeks ago
- β217Updated 3 weeks ago
- On-device speaker recognition engine powered by deep learningβ34Updated this week
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.β34Updated this week
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.β53Updated 4 months ago
- G2Pβ210Updated last week
- faster-whisper as serverless endpointβ95Updated this week
- β156Updated last year
- whisper.cpp bindings for pythonβ94Updated last year
- β325Updated 9 months ago
- Faster Tortoise inference then Tortoise Fast Forkβ128Updated last year
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformersβ40Updated last week
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusionβ174Updated 6 months ago
- Joint speech-language model - respond directly to audio!β30Updated 11 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β62Updated last week
- β255Updated last year
- β355Updated 7 months ago
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β243Updated 10 months ago
- Create an LJSpeech structured voice dataset on wave inputβ27Updated 6 months ago
- Open TTS models, built for streaming on the edgeβ39Updated last month
- A ggml (C++) re-implementation of tortoise-ttsβ178Updated 8 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolateβ103Updated last week
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.β62Updated last year
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, andβ¦β62Updated 3 weeks ago