On-device streaming text-to-speech engine powered by deep learning
☆139May 12, 2026Updated 2 weeks ago
Alternatives and similar repositories for orca
Users that are interested in orca are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 内容审核及速率限制服务☆26May 18, 2025Updated last year
- On-device speaker recognition engine powered by deep learning☆46Updated this week
- On-device LLM Inference Powered by X-Bit Quantization☆312May 15, 2026Updated last week
- On-device Speech-to-Index engine powered by deep learning☆36Apr 16, 2025Updated last year
- On-device speaker diarization powered by deep learning☆72May 8, 2026Updated 2 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- benchmark for Speech-to-Intent engines☆18Mar 27, 2026Updated last month
- On-device streaming speech-to-text engine powered by deep learning☆663Updated this week
- On-device voice activity detection (VAD) powered by deep learning☆253May 11, 2026Updated 2 weeks ago
- On-device noise suppression powered by deep learning☆88Updated this week
- Voice activity engine benchmark framework☆23Jan 14, 2026Updated 4 months ago
- On-device speech-to-text engine powered by deep learning☆478May 11, 2026Updated 2 weeks ago
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated 2 years ago
- VocalVerse: A powerful vocal evaluation framework powered by the Qwen LLMs☆46May 11, 2026Updated 2 weeks ago
- LLm Collaboration☆12Aug 23, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆49Feb 17, 2026Updated 3 months ago
- Diffusion-based Speech Enhancement: Demonstration of Performance and Generalization☆14Dec 21, 2024Updated last year
- Recipes for on-device voice AI and local LLM☆111May 20, 2026Updated last week
- Speaker diarization benchmark framework☆40Jan 8, 2026Updated 4 months ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- A simple implementation for improving CosyVoice2 by GRPO method☆38May 5, 2026Updated 3 weeks ago
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆22May 26, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language comma…☆19Jun 9, 2024Updated last year
- ☆203Sep 24, 2024Updated last year
- A chat UI for Llama.cpp☆16May 13, 2026Updated 2 weeks ago
- A bot that checks your grammar and phrasing using LLM of choice☆32Feb 6, 2025Updated last year
- ☆55Jul 16, 2025Updated 10 months ago
- ☆81Aug 11, 2025Updated 9 months ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 3 years ago
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆59Jun 20, 2024Updated last year
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆269Jan 13, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆26Jul 2, 2024Updated last year
- ☆17May 2, 2024Updated 2 years ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Jun 7, 2024Updated last year
- ☆28Nov 7, 2023Updated 2 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆159Jan 27, 2026Updated 4 months ago
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch☆277Oct 30, 2023Updated 2 years ago