astramind-ai / Auralis
A Fast TTS Engine
☆411Updated this week
Alternatives and similar repositories for Auralis:
Users that are interested in Auralis are comparing it to the libraries listed below
- Interface for OuteTTS models.☆899Updated last week
- Local SRT/LLM/TTS Voicechat☆601Updated 3 months ago
- Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs☆409Updated last week
- Open source inference code for Rev's model☆364Updated last week
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆540Updated last month
- Implementation of F5-TTS in MLX☆448Updated last week
- https://hf.co/hexgrad/Kokoro-82M☆240Updated this week
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching☆985Updated this week
- TTS with kokoro and onnx runtime☆1,321Updated this week
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆80Updated 3 months ago
- ☆152Updated 2 months ago
- Whisper with Medusa heads☆819Updated 3 weeks ago
- A lightweight end-to-end text-to-speech model☆99Updated last month
- Have a natural voice conversation with an LLM☆235Updated last month
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆867Updated 3 months ago
- podcastfy.ai gradio demo app☆325Updated last month
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆595Updated this week
- Excalidraw meets ComfyUI for LLMs☆221Updated last week
- Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models☆278Updated 3 weeks ago
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆395Updated 2 months ago
- ☆254Updated 4 months ago
- Generate accurate transcripts using Apple's MLX framework☆363Updated last month
- Examples for Cerebrium Serverless GPUs☆456Updated this week
- A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…☆80Updated this week
- Turn local files into a prompt for an LLM☆160Updated last week
- AI ContentCraft is an all-in-one content creation suite that helps creators generate stories, podcast scripts, and multimedia content usi…☆250Updated last week
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆175Updated last month
- Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)☆229Updated last month
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆209Updated 3 weeks ago
- first base model for full-duplex conversational audio☆1,691Updated 3 weeks ago