astramind-ai / Auralis
A Fast TTS Engine
☆377Updated this week
Alternatives and similar repositories for Auralis:
Users that are interested in Auralis are comparing it to the libraries listed below
- Interface for OuteTTS models.☆763Updated this week
- Local SRT/LLM/TTS Voicechat☆566Updated 2 months ago
- Open source inference code for Rev's model☆343Updated last month
- Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models☆233Updated this week
- Implementation of F5-TTS in MLX☆389Updated this week
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆468Updated 3 months ago
- podcastfy.ai gradio demo app☆320Updated 2 weeks ago
- ☆247Updated 3 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆70Updated 2 months ago
- ⚡ Insanely fast AI voice assistant with <500ms response times☆334Updated 2 weeks ago
- Examples for Cerebrium Serverless GPUs☆451Updated last week
- Have a natural voice conversation with an LLM☆231Updated last week
- Voice Transformation for Videos. 🎤👄🎬☆221Updated 2 months ago
- Whisper with Medusa heads☆807Updated this week
- Generate accurate transcripts using Apple's MLX framework☆339Updated last week
- ☆147Updated 2 weeks ago
- Text-to-speech API endpoint compatible with OpenAI's TTS API endpoint, using Microsoft Edge TTS to generate speech for free locally☆180Updated 2 weeks ago
- Use OpenAI's realtime API for a chatting with your documents☆283Updated 2 months ago
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆844Updated last month
- A lightweight end-to-end text-to-speech model☆93Updated this week
- 📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)☆208Updated last month
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆209Updated last week
- ☆128Updated last month
- AI-powered assistant to help you with your daily tasks, powered by Llama 3.2. It can recognize your voice, process natural language, and …☆452Updated this week
- Efficient visual programming for AI language models☆313Updated 3 months ago
- Yet another open source Perplexity☆386Updated last month
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆340Updated last month
- Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses l…☆358Updated last month
- ☆300Updated 5 months ago
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆382Updated last month