On-device streaming text-to-speech engine powered by deep learning
☆139Jun 10, 2026Updated this week
Alternatives and similar repositories for orca
Users that are interested in orca are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 内容审核及速率限制服务☆26May 18, 2025Updated last year
- On-device LLM Inference Powered by X-Bit Quantization☆312Updated this week
- On-device Speech-to-Index engine powered by deep learning☆36Apr 16, 2025Updated last year
- On-device speaker diarization powered by deep learning☆73Updated this week
- On-device streaming speech-to-text engine powered by deep learning☆664Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- On-device voice activity detection (VAD) powered by deep learning☆260Updated this week
- On-device noise suppression powered by deep learning☆90Updated this week
- Voice activity engine benchmark framework☆23Jan 14, 2026Updated 5 months ago
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated 2 years ago
- VocalVerse: A powerful vocal evaluation framework powered by the Qwen LLMs☆47May 11, 2026Updated last month
- Master Your Money: Effortless Tracking and Smarter Spending☆12Mar 17, 2024Updated 2 years ago
- LLm Collaboration☆12Aug 23, 2024Updated last year
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆49Feb 17, 2026Updated 3 months ago
- Diffusion-based Speech Enhancement: Demonstration of Performance and Generalization☆14Dec 21, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- On-device AI blueprints for real‑time voice, language, and vision understanding☆111Updated this week
- Parse lightning network payment requests (invoices) in Kotlin.☆21May 27, 2026Updated 2 weeks ago
- Speaker diarization benchmark framework☆40Jan 8, 2026Updated 5 months ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- On-device Speech-to-Intent engine powered by deep learning☆703Updated this week
- A simple implementation for improving CosyVoice2 by GRPO method☆38May 5, 2026Updated last month
- A Telegram bot which generates your intro video programmatically 📽️☆21Feb 29, 2024Updated 2 years ago
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆22May 26, 2025Updated last year
- Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language comma…☆18Jun 9, 2024Updated 2 years ago
- ☆203Sep 24, 2024Updated last year
- A chat UI for Llama.cpp☆16Jun 4, 2026Updated last week
- A bot that checks your grammar and phrasing using LLM of choice☆33Feb 6, 2025Updated last year
- ☆55Jul 16, 2025Updated 11 months ago
- ☆29May 14, 2026Updated last month
- ☆81Aug 11, 2025Updated 10 months ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The Low Latency CDN Powered by Firebase☆22Mar 6, 2021Updated 5 years ago
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆59Jun 20, 2024Updated last year
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆269Jan 13, 2025Updated last year
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆26Jul 2, 2024Updated last year
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Mar 23, 2021Updated 5 years ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Jun 7, 2024Updated 2 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year