ysharma3501 / FastMayaLinks
A highly optimized engine for maya-1 tts model to generate minutes of audio in seconds.
☆53Updated last month
Alternatives and similar repositories for FastMaya
Users that are interested in FastMaya are comparing it to the libraries listed below
Sorting:
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆85Updated last month
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆298Updated 2 months ago
- Kyutai with an "eye"☆230Updated 9 months ago
- ☆329Updated 3 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆25Updated 8 months ago
- A high quality and fast TTS repository☆111Updated last week
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆304Updated 6 months ago
- Streaming and Fine-tuning for Chatterbox TTS☆248Updated 6 months ago
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆57Updated 7 months ago
- VLLM Port of the Chatterbox TTS model☆354Updated 2 months ago
- A web application that converts speech to speech 100% private☆81Updated 6 months ago
- ☆532Updated 2 months ago
- List of curated use cases built using Sesame's CSM 1B☆73Updated 6 months ago
- ☆374Updated last month
- A random walk voice style cloning application for Kokoro text to speech☆189Updated 6 months ago
- Create Unmute voice embeddings☆23Updated last month
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆218Updated 8 months ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆64Updated 9 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31Updated 7 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated last year
- Chatbot-to-speech using Orpheus TTS model. Interactive console app.☆20Updated 7 months ago
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆31Updated last year
- ☆241Updated this week
- kokoro text to speech using javascript☆63Updated 10 months ago
- Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant.☆76Updated 7 months ago
- Very fast, accurate speaker diarization☆193Updated this week
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆346Updated 8 months ago
- The official GitHub Page for MiniMax☆60Updated last month
- Service for testing out the new Qwen2.5 omni model☆61Updated 7 months ago