Reqeique / DimitsLinks
☆15Updated 10 months ago
Alternatives and similar repositories for Dimits
Users that are interested in Dimits are comparing it to the libraries listed below
Sorting:
- API server for Instant voice cloning by MyShell.☆107Updated last year
- whisper.cpp bindings for python☆110Updated 2 years ago
- Python bindings for whisper.cpp☆249Updated last year
- Pybind11 bindings for Whisper.cpp☆62Updated this week
- Python bindings for whisper.cpp☆321Updated last month
- On-device streaming text-to-speech engine powered by deep learning☆128Updated 2 weeks ago
- Python package wrapping llama.cpp for on-device LLM inference☆100Updated 3 months ago
- offline text to speech and free SOTA LLM APIs to let your programs speak to you☆46Updated 2 weeks ago
- Local voice recording for creating Piper datasets☆203Updated 7 months ago
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆712Updated 7 months ago
- ☆207Updated last year
- A random walk voice style cloning application for Kokoro text to speech☆205Updated 7 months ago
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆90Updated last year
- Docker configuration for koboldcpp☆41Updated last year
- IRIS: Demonstrator for use of LLMs in python (outdated)☆63Updated 10 months ago
- Frontier Open-Source Text-to-Speech☆110Updated 4 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated last year
- ☆100Updated last year
- A simple speech-to-text and text-to-speech AI chatbot that can be run fully offline.☆45Updated 2 years ago
- A simple FastAPI Server to run XTTSv2☆572Updated last year
- Simulates talk with an AI that can express emotions☆82Updated 7 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆539Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆266Updated 11 months ago
- XTTSv2 Extension for oobabooga text-generation-webui☆156Updated 2 years ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- An OpenAI API compatible image generation server for the FLUX.1 family of models from Black Forest Labs☆60Updated last year
- An extension for oobabooga/text-generation-webui that enables the LLM to search the web☆275Updated 3 weeks ago
- Voice models for Mimic 3 text to speech system☆161Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆216Updated last year
- ☆13Updated 10 months ago