Kaszebe / Large-Vision-Language-Model-UI
This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.
☆17Updated last month
Related projects ⓘ
Alternatives and complementary repositories for Large-Vision-Language-Model-UI
- 4 million public stable diffusion prompts -- interactive neural search and llama chat☆19Updated 2 months ago
- Screenshot LLM is a Python application that leverages the power of AI to analyze screenshots. Built with PyQt6 for a user-friendly interf…☆29Updated last week
- ☆21Updated 3 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated 2 months ago
- ☆30Updated 6 months ago
- Demo of an "always-on" AI assistant.☆23Updated 8 months ago
- Automated LLM novelist☆35Updated 7 months ago
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated 8 months ago
- A simple extension that uses Bark Text-to-Speech for audio output☆35Updated last year
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆35Updated 2 weeks ago
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…☆31Updated last month
- A stock market bot that automatically, once a day, rebalances your Robinhood portfolio by gathering information about each ticker in the …☆34Updated 2 weeks ago
- MFLUX-WEBUI using MLX and the FLUX DEV and Schnell models☆21Updated last week
- Crow is a Desktop AI Assistant☆27Updated 3 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆27Updated last week
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆71Updated this week
- Diffusion_TTS extension for booga☆63Updated 4 months ago
- ☆25Updated last month
- LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …☆35Updated 2 months ago
- API server for Instant voice cloning by MyShell.☆69Updated last month
- Overide (pronounced over·ide) is a lightweight, yet powerful CLI tool that seamlessly integrates AI-powered code generation into your dev…☆93Updated this week
- ☆17Updated 3 weeks ago
- Windows-compatible Fast API implementation of VoiceCraft, the Zero-Shot Speech Editing and Text-to-Speech in the Wild☆18Updated 6 months ago
- LLM backed Fantasy Tribe Game☆17Updated last week
- Intuitive basic interface for interacting with multiple LLMs at the same time☆33Updated 3 weeks ago
- All the world is a play, we are but actors in it.☆47Updated 4 months ago
- Large Model Proxy is designed to make it easy to run multiple resource-heavy Large Models (LM) on the same machine with limited amount of…☆46Updated last month
- Ollama models of NousResearch/Hermes-2-Pro-Mistral-7B-GGUF☆32Updated 7 months ago
- Mistral7B playing DOOM☆27Updated 7 months ago
- idea: https://github.com/nyxkrage/ebook-groupchat/☆81Updated 2 months ago