astramind-ai / AuralisView external linksLinks
A Fast TTS Engine
☆614Jan 23, 2025Updated last year
Alternatives and similar repositories for Auralis
Users that are interested in Auralis are comparing it to the libraries listed below
Sorting:
- Interface for OuteTTS models.☆1,421Jun 21, 2025Updated 7 months ago
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…☆2,247Jan 9, 2026Updated last month
- ☆54Jul 16, 2025Updated 7 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆14,079Updated this week
- A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …☆13Dec 8, 2025Updated 2 months ago
- Inference and training library for high-quality TTS models.☆5,528Dec 10, 2024Updated last year
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Feb 10, 2025Updated last year
- This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …☆57Aug 9, 2025Updated 6 months ago
- first base model for full-duplex conversational audio☆1,773Jan 5, 2025Updated last year
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching☆4,403Jan 4, 2026Updated last month
- Towards Human-Sounding Speech☆5,944Dec 5, 2025Updated 2 months ago
- Fast and accurate automatic speech recognition (ASR) for edge devices☆3,142Updated this week
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 6 months ago
- SOTA Open Source TTS☆24,863Feb 2, 2026Updated last week
- SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversati…☆29May 6, 2025Updated 9 months ago
- An unofficial PyTorch implementation of VALL-E☆88Aug 3, 2025Updated 6 months ago
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.☆850Feb 2, 2025Updated last year
- Project of Singing Voice Conversion.☆16Oct 27, 2023Updated 2 years ago
- ☆15Nov 11, 2024Updated last year
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆45Feb 9, 2026Updated last week
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆91Jul 23, 2025Updated 6 months ago
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆9,687May 27, 2025Updated 8 months ago
- win32 native frontend for llama-cli☆12Nov 2, 2024Updated last year
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆6,162Aug 10, 2024Updated last year
- ☆19Mar 22, 2024Updated last year
- Real-time end-to-end singing voice convertion☆23Nov 3, 2024Updated last year
- ☆2,935Updated this week
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆111Apr 1, 2024Updated last year
- An Open Source text-to-speech system built by inverting Whisper.☆4,555Dec 14, 2025Updated 2 months ago
- TTS with kokoro and onnx runtime☆2,371Jan 30, 2026Updated 2 weeks ago
- Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expres…☆7,160Mar 5, 2025Updated 11 months ago
- LIVA - Local Intelligent Voice Assistant☆61Aug 28, 2024Updated last year
- High quality text-to-speech based on StyleTTS 2.☆72Updated this week
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆887Dec 10, 2025Updated 2 months ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆7,186Dec 24, 2024Updated last year