Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.
☆17Feb 5, 2024Updated 2 years ago
Alternatives and similar repositories for Small-Multimodal-Vision-Model
Users that are interested in Small-Multimodal-Vision-Model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- code for "Automated and Intelligent Synthesis of Oxygen-Producing Catalysts from Martian Meteorites by Robotic AI-Chemist "☆12Jul 31, 2023Updated 2 years ago
- Auto-Video maker handling many AI's☆11Mar 18, 2024Updated 2 years ago
- Fine Tuning Multimodal LLM "Idefics 9B" on Pokemon Go Dataset available on Hugging Face.☆18Jan 15, 2024Updated 2 years ago
- Live audio chats with AI using Groq Llama3-70b and Deepgram Voice☆32Apr 24, 2024Updated 2 years ago
- Simple LLM interface based on terminal.☆12Jan 4, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A simple demo application showcasing the power of Gemini 1.5 Pro's video understanding capabilities.☆31May 24, 2024Updated 2 years ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆16Jun 5, 2026Updated 3 weeks ago
- Qwen2-VL for OCR & VQA☆19Sep 3, 2024Updated last year
- Agent with vision ability via llava & autogen☆75Oct 16, 2023Updated 2 years ago
- ☆17Apr 18, 2025Updated last year
- Okra, your all in one personal AI assistant☆14Jun 14, 2024Updated 2 years ago
- Dataset for the investigation of visual semiotics, and how specific visual features and design choices can elicit specific emotions, thou…☆10Dec 13, 2023Updated 2 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆21Nov 3, 2025Updated 7 months ago
- Research Paper: Fuzzy Model Identification Based on Cluster Estimation☆10Jun 1, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Make tool-calling schemas for existing tools☆14Mar 8, 2025Updated last year
- [ICLR 2025] Large (Vision) Language Models are Unsupervised In-Context Learners☆22Jun 6, 2025Updated last year
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆25Dec 2, 2025Updated 6 months ago
- Text-to-video generation.☆20Jul 18, 2022Updated 3 years ago
- Democratizing Function Calling Capabilities for Open-Source Language Models☆43May 5, 2024Updated 2 years ago
- This curated list highlights the latest breakthroughs in EEG and AI integration, providing a user-friendly guide for researchers, student…☆24Dec 26, 2024Updated last year
- A proxy for Google Bard LLM☆10Nov 2, 2023Updated 2 years ago
- ☆15Mar 6, 2026Updated 3 months ago
- ☆18Mar 26, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Experiments with LAVIS library to perform image2text and text2image retrieval with BLIP and BLIP2 models☆15Sep 25, 2023Updated 2 years ago
- using g4f & embedding tools to mock openai server☆12Aug 20, 2023Updated 2 years ago
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Jan 17, 2021Updated 5 years ago
- A guide to structured generation using constrained decoding☆18Jun 9, 2024Updated 2 years ago
- The browser extension of SydneyQt that enables multiple shortcuts, including resolve CAPTCHA automatically etc.☆10Jan 27, 2024Updated 2 years ago
- ☆33May 22, 2024Updated 2 years ago
- Retrieval Augmented Generation, but no servers involved. Backed by S3☆12Nov 3, 2023Updated 2 years ago
- ⚡ Building applications with LLMs through composability ⚡☆14Mar 10, 2023Updated 3 years ago
- Local character AI chatbot with chroma vector store memory and some scripts to process documents for Chroma☆35Oct 7, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Gemini Bot is a Telegram chatbot powered by Vertex AI's generative models. This Python implementation utilizes the Telethon library to in…☆12Feb 17, 2024Updated 2 years ago
- rUv-Engineer - let's you describe UI using your imagination, then see it rendered live.☆13Sep 28, 2024Updated last year
- ☆88Mar 7, 2024Updated 2 years ago
- Get the latest news about any topic or entity using GPT-4 and AYLIEN News API.☆31Feb 12, 2024Updated 2 years ago
- 网页newbing ai转api调用库,优化对话模式☆13Feb 11, 2025Updated last year
- OpenClaw Operator gives coding agents like Codex and Claude Code the context and playbooks needed to set up, validate, and troubleshoot a…☆20Mar 7, 2026Updated 3 months ago
- A comfyui costume node by BillBum for using api gen (VLM LLM T2I API Tools)☆10May 26, 2026Updated last month