Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)
☆349Apr 10, 2025Updated 10 months ago
Alternatives and similar repositories for orpheus-cpp
Users that are interested in orpheus-cpp are comparing it to the libraries listed below
Sorting:
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…☆23Mar 16, 2025Updated 11 months ago
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆663Jul 5, 2025Updated 7 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- Open TTS models, built for streaming on the edge☆45Mar 16, 2025Updated 11 months ago
- Interface for OuteTTS models.☆1,426Jun 21, 2025Updated 8 months ago
- ☆15Feb 1, 2025Updated last year
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Mar 28, 2025Updated 11 months ago
- ☆17Apr 9, 2025Updated 10 months ago
- Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗☆700Jul 10, 2025Updated 7 months ago
- OpenPipe Reinforcement Learning Experiments☆32Mar 14, 2025Updated 11 months ago
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆147May 18, 2025Updated 9 months ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- Towards Human-Sounding Speech☆5,968Dec 5, 2025Updated 2 months ago
- mnn tts demo.☆19May 7, 2025Updated 9 months ago
- ☆475May 19, 2025Updated 9 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆289Apr 14, 2025Updated 10 months ago
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆41Jan 27, 2026Updated last month
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Dec 1, 2024Updated last year
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆397Aug 15, 2025Updated 6 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆308May 31, 2025Updated 9 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆94Oct 8, 2025Updated 4 months ago
- A random walk voice style cloning application for Kokoro text to speech☆213Jun 16, 2025Updated 8 months ago
- An interface that features barely zero external dependencies beyond the Ollama API itself, making it lightweight and portable to easily i…☆12Mar 25, 2025Updated 11 months ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆13Mar 9, 2022Updated 3 years ago
- Using OpenVINO to speed up MeloTTS inference☆15Nov 1, 2024Updated last year
- Orpheus Chat WebUI☆76Mar 27, 2025Updated 11 months ago
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆2,863Jan 26, 2026Updated last month
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆29Dec 11, 2025Updated 2 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31May 1, 2025Updated 10 months ago
- Official implementation of the TTS model Lina-Speech☆178Jan 9, 2025Updated last year
- The python library for real-time communication☆4,535Jan 12, 2026Updated last month
- Streaming and Fine-tuning for Chatterbox TTS☆276Jun 15, 2025Updated 8 months ago
- ☆26Jan 15, 2025Updated last year
- Use smol agents to do research and then update csv coumns with its findings.☆41Jan 30, 2025Updated last year
- Transfer learning approach to pronunciation scoring☆11Jan 17, 2024Updated 2 years ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆57May 17, 2025Updated 9 months ago
- LLamaHTML is a simple html file to communicate with a running llamacpp llama-server☆22Aug 5, 2025Updated 6 months ago