A Fast TTS Engine
☆614Jan 23, 2025Updated last year
Alternatives and similar repositories for Auralis
Users that are interested in Auralis are comparing it to the libraries listed below
Sorting:
- Interface for OuteTTS models.☆1,427Jun 21, 2025Updated 8 months ago
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…☆2,265Jan 9, 2026Updated 2 months ago
- ☆54Jul 16, 2025Updated 7 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆14,169Updated this week
- A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …☆13Dec 8, 2025Updated 3 months ago
- Inference and training library for high-quality TTS models.☆5,547Dec 10, 2024Updated last year
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …☆57Aug 9, 2025Updated 7 months ago
- first base model for full-duplex conversational audio☆1,783Jan 5, 2025Updated last year
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching☆4,526Jan 4, 2026Updated 2 months ago
- Towards Human-Sounding Speech☆5,983Dec 5, 2025Updated 3 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 7 months ago
- SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversati…☆29May 6, 2025Updated 10 months ago
- An unofficial PyTorch implementation of VALL-E☆88Aug 3, 2025Updated 7 months ago
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.☆853Feb 2, 2025Updated last year
- SOTA Open Source TTS☆25,154Updated this week
- ☆15Nov 11, 2024Updated last year
- Project of Singing Voice Conversion.☆16Oct 27, 2023Updated 2 years ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆45Updated this week
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆91Jul 23, 2025Updated 7 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- win32 native frontend for llama-cli☆12Nov 2, 2024Updated last year
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆9,706May 27, 2025Updated 9 months ago
- ☆19Mar 22, 2024Updated last year
- Real-time end-to-end singing voice convertion☆24Nov 3, 2024Updated last year
- ☆3,002Mar 2, 2026Updated last week
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆6,196Aug 10, 2024Updated last year
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆111Apr 1, 2024Updated last year
- An Open Source text-to-speech system built by inverting Whisper.☆4,568Dec 14, 2025Updated 2 months ago
- Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expres…☆7,193Mar 5, 2025Updated last year
- LIVA - Local Intelligent Voice Assistant☆61Aug 28, 2024Updated last year
- High quality text-to-speech based on StyleTTS 2.☆73Feb 25, 2026Updated last week
- TTS with kokoro and onnx runtime☆2,398Jan 30, 2026Updated last month
- Fast and accurate automatic speech recognition (ASR) for edge devices☆6,884Updated this week
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆7,232Dec 24, 2024Updated last year
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Spe…☆3,951Aug 14, 2025Updated 6 months ago