Kaszebe / Large-Vision-Language-Model-UILinks

This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.

☆30

Alternatives and similar repositories for Large-Vision-Language-Model-UI

Users that are interested in Large-Vision-Language-Model-UI are comparing it to the libraries listed below

Sorting:

efogdev / apollo
Deploy Apollo HF space locally
☆40Updated 7 months ago
PasiKoodaa / dia
A TTS model capable of generating ultra-realistic dialogue in one pass.
☆31Updated 3 months ago
cocktailpeanut / hallucinator
☆51Updated 8 months ago
zenforic / csm-multi
Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…
☆23Updated 4 months ago
PasiKoodaa / ACE-Step-RADIO
ACE-Step: A Step Towards Music Generation Foundation Model
☆42Updated 2 months ago
akashjss / orpheus-tts-local-webui
Run Orpheus 3B Locally with Gradio UI, Standalone App
☆23Updated 4 months ago
severian42 / Computational-Model-for-Symbolic-Representations
Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …
☆49Updated 5 months ago
parsakhaz / gaze-detection-video
Use the Moondream 2 model to detect faces and their gaze directions in videos.
☆44Updated 6 months ago
mounta11n / plusplus-camall
After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…
☆54Updated 11 months ago
abgulati / hf-waitress
Serving LLMs in the HF-Transformers format via a PyFlask API
☆71Updated 10 months ago
dynamiccreator / voice-text-reader
Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)
☆52Updated 9 months ago
LAION-AI / Desktop_BUD-E
BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…
☆36Updated last year
cocktailpeanut / deeperhermes
deep hermes, but decides how to respond based on its OWN decision, no need for system prompts.
☆39Updated 4 months ago
jabberjabberjabber / Chunkify
Create text chunks which end at natural stopping points without using a tokenizer
☆26Updated 4 months ago
5aharsh / collama
Run Ollama LLM models in Google Colab for free
☆36Updated 8 months ago
bdambrosio / AllTheWorldAPlay
All the world is a play, we are but actors in it.
☆50Updated last week
yelboudouri / SwitchAI
A unified library for interacting with various AI APIs through a standardized interface.
☆31Updated 4 months ago
ivoras / llmtalkie
A micro LLM multi-agent system for data analysis
☆19Updated 3 months ago
astramind-ai / Pulsar
The hearth of The Pulsar App, fast, secure and shared inference with modern UI
☆55Updated 8 months ago
Antoine-Villiere / JacQues
JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.
☆22Updated 10 months ago
thooton / aspen
Personal voice assistant, with voice interruption and Twilio support
☆18Updated 5 months ago
Fus3n / TwoAI
A simple experiment on letting two local LLM have a conversation about anything!
☆110Updated last year
PkmX / orpheus-chat-webui
Orpheus Chat WebUI
☆69Updated 4 months ago
CalvesGEH / VoiceCraftAPI
An API for VoiceCraft.
☆25Updated last year
charmandercha / ArchiDoc
☆17Updated 7 months ago
rodrigobaron / anthill
☆24Updated 6 months ago
Aesthisia / LLMinator
Gradio based tool to run opensource LLM models directly from Huggingface
☆94Updated last year
AndrewVeee / assistant-demo
Demo of an "always-on" AI assistant.
☆24Updated last year
menloresearch / ichigo-demo
☆91Updated 2 months ago
stringandstickytape / MaxsAiStudio
A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.
☆33Updated last week