Picovoice / orca
On-device streaming text-to-speech engine powered by deep learning
β73Updated this week
Alternatives and similar repositories for orca:
Users that are interested in orca are comparing it to the libraries listed below
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ154Updated 8 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ94Updated 10 months ago
- On-device speaker recognition engine powered by deep learningβ32Updated this week
- Efficient approach to speaker diarization using voice characteristics extractionβ92Updated 10 months ago
- A ggml (C++) re-implementation of tortoise-ttsβ178Updated 7 months ago
- On-device speaker diarization powered by deep learningβ39Updated this week
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.β45Updated last month
- β207Updated 5 months ago
- π§ | RunPod worker of the faster-whisper model for Serverless Endpoint.β89Updated last month
- G2Pβ171Updated this week
- β95Updated 10 months ago
- β91Updated 2 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.β60Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translationβ113Updated last year
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, andβ¦β56Updated last week
- β318Updated 8 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.β51Updated 3 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!β252Updated 4 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)β71Updated 9 months ago
- Joint speech-language model - respond directly to audio!β30Updated 10 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.β32Updated 2 weeks ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β60Updated last week
- On-device voice activity detection (VAD) powered by deep learningβ202Updated this week
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.β26Updated 9 months ago
- Collection of Open Source Speech Dataβ152Updated 4 months ago
- Running the F5-TTS by ONNX Runtimeβ123Updated this week
- Create an LJSpeech structured voice dataset on wave inputβ26Updated 5 months ago
- β155Updated last year