Picovoice / orcaLinks
On-device streaming text-to-speech engine powered by deep learning
☆120Updated 3 weeks ago
Alternatives and similar repositories for orca
Users that are interested in orca are comparing it to the libraries listed below
Sorting:
- A random walk voice style cloning application for Kokoro text to speech☆141Updated 3 months ago
- Very fast, accurate speaker diarization☆129Updated this week
- ☆155Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆212Updated 5 months ago
- Streaming and Fine-tuning for Chatterbox TTS☆190Updated 3 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆157Updated last year
- Open Audio Watermarking Tool☆292Updated 3 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆69Updated 2 months ago
- ☆100Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆190Updated last year
- On-device noise suppression powered by deep learning☆75Updated last month
- ☆42Updated this week
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆57Updated 4 months ago
- ☆253Updated last month
- G2P☆321Updated last month
- Joint speech-language model - respond directly to audio!☆372Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆54Updated 9 months ago
- On-device speaker recognition engine powered by deep learning☆37Updated last month
- TTS support with GGML☆179Updated last month
- Open models for Coqui STT☆144Updated 2 years ago
- faster-whisper as serverless endpoint☆118Updated 4 months ago
- On-device voice activity detection (VAD) powered by deep learning☆230Updated last week
- Efficient approach to speaker diarization using voice characteristics extraction☆101Updated 3 months ago
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆198Updated 7 months ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆24Updated last month
- Create an LJSpeech structured voice dataset on wave input☆35Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- On-device speaker diarization powered by deep learning☆54Updated last month
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆125Updated last month