Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.
☆17Feb 5, 2024Updated 2 years ago
Alternatives and similar repositories for Small-Multimodal-Vision-Model
Users that are interested in Small-Multimodal-Vision-Model are comparing it to the libraries listed below
Sorting:
- Auto-Video maker handling many AI's☆11Mar 18, 2024Updated 2 years ago
- Fine Tuning Multimodal LLM "Idefics 9B" on Pokemon Go Dataset available on Hugging Face.☆18Jan 15, 2024Updated 2 years ago
- Simple LLM interface based on terminal.☆12Jan 4, 2024Updated 2 years ago
- Simple Chainlit UI for running llms from Groq and LangChain☆17Feb 28, 2024Updated 2 years ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆16Feb 4, 2024Updated 2 years ago
- Qwen2-VL for OCR & VQA☆19Sep 3, 2024Updated last year
- Agent with vision ability via llava & autogen☆74Oct 16, 2023Updated 2 years ago
- Dataset for the investigation of visual semiotics, and how specific visual features and design choices can elicit specific emotions, thou…☆10Dec 13, 2023Updated 2 years ago
- Okra, your all in one personal AI assistant☆14Jun 14, 2024Updated last year
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 4 months ago
- Research Paper: Fuzzy Model Identification Based on Cluster Estimation☆10Jun 1, 2021Updated 4 years ago
- Make tool-calling schemas for existing tools☆14Mar 8, 2025Updated last year
- [ICLR 2025] Large (Vision) Language Models are Unsupervised In-Context Learners☆22Jun 6, 2025Updated 9 months ago
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆21Dec 2, 2025Updated 3 months ago
- Democratizing Function Calling Capabilities for Open-Source Language Models☆42May 5, 2024Updated last year
- ☆15Mar 6, 2026Updated 2 weeks ago
- ☆32May 22, 2024Updated last year
- A guide to structured generation using constrained decoding☆14Jun 9, 2024Updated last year
- using g4f & embedding tools to mock openai server☆12Aug 20, 2023Updated 2 years ago
- The official code for "TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning" | [AAAI2025]☆49Mar 13, 2025Updated last year
- The browser extension of SydneyQt that enables multiple shortcuts, including resolve CAPTCHA automatically etc.☆10Jan 27, 2024Updated 2 years ago
- Retrieval Augmented Generation, but no servers involved. Backed by S3☆12Nov 3, 2023Updated 2 years ago
- ClawLess — A serverless browser-based runtime for Claw AI Agents powered by WebContainers☆87Updated this week
- ⚡ Building applications with LLMs through composability ⚡☆14Mar 10, 2023Updated 3 years ago
- A simple dify bot☆34Apr 16, 2025Updated 11 months ago
- Experience the power of Generative Pretrained Transformers with a user-friendly interface.☆14May 23, 2025Updated 9 months ago
- A drag-and-drop-enabled, responsive, envelope graph that allows to shape a wave with attack, decay, sustain and release☆11Jan 5, 2023Updated 3 years ago
- Local character AI chatbot with chroma vector store memory and some scripts to process documents for Chroma☆35Oct 7, 2024Updated last year
- Gemini Bot is a Telegram chatbot powered by Vertex AI's generative models. This Python implementation utilizes the Telethon library to in…☆12Feb 17, 2024Updated 2 years ago
- rUv-Engineer - let's you describe UI using your imagination, then see it rendered live.☆11Sep 28, 2024Updated last year
- ☆42Mar 10, 2026Updated last week
- Get the latest news about any topic or entity using GPT-4 and AYLIEN News API.☆31Feb 12, 2024Updated 2 years ago
- 网页newbing ai转api调用库,优化对话模式☆12Feb 11, 2025Updated last year
- PowerShell integration for Google's versatile Gemini Pro API☆20Dec 20, 2023Updated 2 years ago
- One-shot logo detection on images. Implementation of the paper "A Deep One-Shot Network for Query-based LogoRetrieval" (Bhunia et al. 201…☆22Jun 18, 2024Updated last year
- A comfyui costume node by BillBum for using api gen (VLM LLM T2I API Tools)☆10Feb 4, 2026Updated last month
- Rivet plugin for integration with Ollama, the tool for running LLMs locally easily☆43Jun 5, 2025Updated 9 months ago
- 🧠 A curated list of awesome ChatGPT resources, including libraries, SDKs, APIs, and more. 🌟 Please consider supporting this project by …☆12Apr 11, 2023Updated 2 years ago
- A set of Custom Nodes for Compositing for ComfyUI☆14Nov 24, 2024Updated last year