3eeps / llmon-pyLinks
llmon-py is a multimodal webui for Llama 3-8B.
☆16Updated last year
Alternatives and similar repositories for llmon-py
Users that are interested in llmon-py are comparing it to the libraries listed below
Sorting:
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆57Updated 5 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆43Updated last month
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆25Updated 7 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated last year
- ☆50Updated last week
- Robust Speech Recognition via Large-Scale Weak Supervision☆29Updated last year
- Sophia AI Assistant is a Python-based desktop AI that performs a variety of tasks, including answering questions, opening applications, b…☆22Updated last year
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆64Updated 7 months ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated 2 years ago
- Streaming and Fine-tuning for Chatterbox TTS☆204Updated 4 months ago
- Open TTS models, built for streaming on the edge☆43Updated 7 months ago
- A random walk voice style cloning application for Kokoro text to speech☆152Updated 4 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated last week
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆28Updated last year
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆25Updated last month
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆54Updated 10 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆128Updated 2 months ago
- Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet☆37Updated 9 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Updated last year
- A Model (maybe an app) that translates the audio of a video from one language to another language, cloning the voice of original video wi…☆15Updated 5 months ago
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆77Updated 11 months ago
- SoTA open-source TTS☆103Updated 4 months ago
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆42Updated last year
- ☆51Updated 8 months ago
- ☆99Updated last year
- Examples of using the llasa-tts models locally☆181Updated 6 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year