Picovoice / orcaLinks
On-device streaming text-to-speech engine powered by deep learning
β120Updated last week
Alternatives and similar repositories for orca
Users that are interested in orca are comparing it to the libraries listed below
Sorting:
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ159Updated last year
- A ggml (C++) re-implementation of tortoise-ttsβ187Updated last year
- A random walk voice style cloning application for Kokoro text to speechβ128Updated 2 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.β205Updated 4 months ago
- β154Updated last year
- Streaming and Fine-tuning for Chatterbox TTSβ178Updated 2 months ago
- β99Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.β68Updated 2 months ago
- Joint speech-language model - respond directly to audio!β371Updated last year
- G2Pβ316Updated last month
- On-device speaker recognition engine powered by deep learningβ37Updated last month
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ97Updated last year
- On-device noise suppression powered by deep learningβ74Updated last month
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformersβ57Updated 3 months ago
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"β196Updated 6 months ago
- β40Updated this week
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.β63Updated last year
- C++ library for converting text to phonemes for Piperβ134Updated 2 months ago
- TTS support with GGMLβ175Updated 2 weeks ago
- Very fast, accurate speaker diarizationβ87Updated this week
- β236Updated 2 weeks ago
- faster-whisper as serverless endpointβ117Updated 3 months ago
- fast state-of-the-art speech models and a runtime that runs anywhere π₯β55Updated 3 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.β54Updated 9 months ago
- On-device voice activity detection (VAD) powered by deep learningβ228Updated last month
- whisper.cpp bindings for pythonβ102Updated 2 years ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.β42Updated this week
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, andβ¦β115Updated 5 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restorationβ184Updated 4 months ago
- Efficient approach to speaker diarization using voice characteristics extractionβ99Updated 2 months ago