Kaszebe / Large-Vision-Language-Model-UILinks
This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.
☆30Updated 11 months ago
Alternatives and similar repositories for Large-Vision-Language-Model-UI
Users that are interested in Large-Vision-Language-Model-UI are comparing it to the libraries listed below
Sorting:
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆25Updated 5 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31Updated 4 months ago
- Deploy Apollo HF space locally☆40Updated 9 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated last year
- ☆51Updated 10 months ago
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆22Updated 5 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆43Updated 3 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆95Updated last year
- Demo of an "always-on" AI assistant.☆24Updated last year
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆44Updated 8 months ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Updated last year
- A simple experiment on letting two local LLM have a conversation about anything!☆112Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆39Updated last year
- ☆24Updated 7 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆52Updated 7 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆33Updated this week
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 10 months ago
- SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversati…☆28Updated 4 months ago
- LLM backed Fantasy Tribe Game☆19Updated 9 months ago
- ☆23Updated 10 months ago
- An API for VoiceCraft.☆25Updated last year
- [WIP] AI Try-On plugin for Chrome☆28Updated last year
- Personal voice assistant, with voice interruption and Twilio support☆18Updated 6 months ago
- ☆116Updated 8 months ago
- Game Companion AI is an advanced application designed to enhance the gaming experience by providing real-time analysis and interpretation…☆53Updated 11 months ago
- ☆22Updated last year
- Something similar to Apple Intelligence?☆61Updated last year
- Experimental LLM Inference UX to aid in creative writing☆121Updated 9 months ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆64Updated 5 months ago
- run ollama & gguf easily with a single command☆52Updated last year