On-device streaming text-to-speech engine powered by deep learning
☆131Feb 24, 2026Updated last week
Alternatives and similar repositories for orca
Users that are interested in orca are comparing it to the libraries listed below
Sorting:
- On-device speaker recognition engine powered by deep learning☆41Feb 13, 2026Updated 2 weeks ago
- 内容审核及速率限制服务☆26May 18, 2025Updated 9 months ago
- On-device LLM Inference Powered by X-Bit Quantization☆300Feb 23, 2026Updated last week
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated last year
- VocalVerse: A powerful vocal evaluation framework powered by the Qwen LLMs☆38Jan 22, 2026Updated last month
- On-device speaker diarization powered by deep learning☆69Updated this week
- Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language comma…☆19Jun 9, 2024Updated last year
- On-device streaming speech-to-text engine powered by deep learning☆660Updated this week
- On-device voice activity detection (VAD) powered by deep learning☆244Feb 13, 2026Updated 2 weeks ago
- ☆28Nov 7, 2023Updated 2 years ago
- ☆54Jul 16, 2025Updated 7 months ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- Simple Summarizer Tool using Llama 3 8b.☆10May 14, 2024Updated last year
- Diffusion-based Speech Enhancement: Demonstration of Performance and Generalization☆12Dec 21, 2024Updated last year
- 自用,语音到文本用的sencevoice,llm部分基于ollama的API调用,文本到语音用的cosyvoice,实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。☆12Dec 26, 2024Updated last year
- LLm Collaboration☆12Aug 23, 2024Updated last year
- A chat UI for Llama.cpp☆15Dec 2, 2025Updated 3 months ago
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- On-device noise suppression powered by deep learning☆83Updated this week
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- A simple implementation for improving CosyVoice2 by GRPO method☆32Oct 17, 2025Updated 4 months ago
- Parse lightning network payment requests (invoices) in Kotlin.☆17Feb 9, 2026Updated 3 weeks ago
- On-device speech-to-text engine powered by deep learning☆472Feb 26, 2026Updated last week
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Jun 7, 2024Updated last year
- On-device Speech-to-Index engine powered by deep learning☆36Apr 16, 2025Updated 10 months ago
- ☆14Aug 19, 2024Updated last year
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆38Feb 17, 2026Updated 2 weeks ago
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆59Jun 20, 2024Updated last year
- benchmark for Speech-to-Intent engines☆17Dec 18, 2025Updated 2 months ago
- ☆18May 27, 2025Updated 9 months ago
- PAM is a no-reference audio quality metric for audio generation tasks☆77Jul 19, 2024Updated last year
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Feb 2, 2026Updated last month
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- Voice activity engine benchmark framework☆21Jan 14, 2026Updated last month
- ☆18Jan 18, 2024Updated 2 years ago
- Joint speech-language model - respond directly to audio!☆372Jul 1, 2024Updated last year
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆36Sep 21, 2022Updated 3 years ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- ☆204Sep 24, 2024Updated last year