Kaszebe / Large-Vision-Language-Model-UI
This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.
☆26Updated 3 months ago
Alternatives and similar repositories for Large-Vision-Language-Model-UI:
Users that are interested in Large-Vision-Language-Model-UI are comparing it to the libraries listed below
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated 5 months ago
- ☆43Updated 2 months ago
- Deploy Apollo HF space locally☆39Updated last month
- ☆27Updated 3 months ago
- ☆20Updated 3 months ago
- Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language comma…☆17Updated 7 months ago
- idea: https://github.com/nyxkrage/ebook-groupchat/☆85Updated 5 months ago
- ☆29Updated last month
- [WIP] AI Try-On plugin for Chrome☆27Updated 10 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆31Updated 6 months ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Updated 4 months ago
- V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!☆32Updated last month
- Gradio based tool to run opensource LLM models directly from Huggingface☆90Updated 7 months ago
- Screenshot LLM is a Python application that leverages the power of AI to analyze screenshots. Built with PyQt6 for a user-friendly interf…☆33Updated 2 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆29Updated this week
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…☆41Updated 3 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Updated last month
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆30Updated 6 months ago
- Mistral7B playing DOOM☆28Updated 10 months ago
- run ollama & gguf easily with a single command☆49Updated 8 months ago
- Writing Extension for Text Generation WebUI☆45Updated 2 weeks ago
- Large Model Proxy is designed to make it easy to run multiple resource-heavy Large Models (LM) on the same machine with limited amount of…☆49Updated 3 months ago
- LLM backed Fantasy Tribe Game☆18Updated 2 months ago
- All the world is a play, we are but actors in it.☆47Updated this week
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆18Updated last month
- A unified library for interacting with various AI APIs through a standardized interface.☆27Updated this week
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆50Updated 3 months ago
- ☆109Updated last month
- ☆39Updated this week
- A stock market bot that automatically, once a day, rebalances your Robinhood portfolio by gathering information about each ticker in the …☆49Updated last month