Kaszebe / Large-Vision-Language-Model-UI
This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.
☆29Updated 5 months ago
Alternatives and similar repositories for Large-Vision-Language-Model-UI:
Users that are interested in Large-Vision-Language-Model-UI are comparing it to the libraries listed below
- Create text chunks which end at natural stopping points without using a tokenizer☆26Updated 2 months ago
- Deploy Apollo HF space locally☆40Updated 2 months ago
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆39Updated 2 months ago
- All the world is a play, we are but actors in it.☆47Updated this week
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…☆49Updated 5 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated 6 months ago
- ☆21Updated 4 months ago
- ☆46Updated 4 months ago
- SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversati…☆25Updated 2 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 8 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆29Updated this week
- A repository to store helpful information and emerging insights in regard to LLMs☆20Updated last year
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year
- LLM backed Fantasy Tribe Game☆18Updated 3 months ago
- Experimental LLM Inference UX to aid in creative writing☆112Updated 3 months ago
- Demo of an "always-on" AI assistant.☆24Updated last year
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Updated 6 months ago
- ☆16Updated 2 months ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆53Updated 4 months ago
- Science-driven chatbot development☆56Updated 10 months ago
- Based on kylemcdonald/i2i-realtime. The warping server for GenDJ real time webcam AI warping☆27Updated 8 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆32Updated 7 months ago
- 100% Local Document deep search with LLMs☆26Updated 6 months ago
- ☆28Updated 5 months ago
- Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language comma…☆17Updated 9 months ago
- Run Ollama LLM models in Google Colab for free☆33Updated 3 months ago
- Capture, tag, and search images locally with OSS models.☆40Updated last month