Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.
☆17Feb 5, 2024Updated 2 years ago
Alternatives and similar repositories for Small-Multimodal-Vision-Model
Users that are interested in Small-Multimodal-Vision-Model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo lets you run mistral-7b in Google Colab.☆16Oct 1, 2023Updated 2 years ago
- Live audio chats with AI using Groq Llama3-70b and Deepgram Voice☆32Apr 24, 2024Updated 2 years ago
- PaliGemma Inference and Fine Tuning☆13May 15, 2024Updated 2 years ago
- Simple LLM interface based on terminal.☆12Jan 4, 2024Updated 2 years ago
- Repository for the companion Colab notebook of the Domain-Specific Small Language Models book.☆38May 11, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Simple Chainlit UI for running llms from Groq and LangChain☆17Feb 28, 2024Updated 2 years ago
- Qwen2-VL for OCR & VQA☆19Sep 3, 2024Updated last year
- Agent with vision ability via llava & autogen☆75Oct 16, 2023Updated 2 years ago
- ☆17Apr 18, 2025Updated last year
- local whisper input by Whisper or SenseVoice/FunASR☆22Mar 5, 2025Updated last year
- MCP server for AI image generation and editing using Google's Gemini Flash models. Create images from text prompts with intelligent filen…☆33Mar 15, 2026Updated 2 months ago
- Dataset for the investigation of visual semiotics, and how specific visual features and design choices can elicit specific emotions, thou…☆10Dec 13, 2023Updated 2 years ago
- Okra, your all in one personal AI assistant☆14Jun 14, 2024Updated last year
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆20Nov 3, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Make tool-calling schemas for existing tools☆14Mar 8, 2025Updated last year
- ☆18Oct 4, 2025Updated 7 months ago
- [ICLR 2025] Large (Vision) Language Models are Unsupervised In-Context Learners☆22Jun 6, 2025Updated 11 months ago
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆23Dec 2, 2025Updated 5 months ago
- Text-to-video generation.☆19Jul 18, 2022Updated 3 years ago
- Democratizing Function Calling Capabilities for Open-Source Language Models☆42May 5, 2024Updated 2 years ago
- This curated list highlights the latest breakthroughs in EEG and AI integration, providing a user-friendly guide for researchers, student…☆24Dec 26, 2024Updated last year
- Time series and Financial analysis in python☆14Mar 28, 2019Updated 7 years ago
- ☆15Mar 6, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Mar 26, 2022Updated 4 years ago
- ☆31Apr 5, 2025Updated last year
- This example shows how you can use Sandpack and firepad-x to build a collbrative text editor.☆23Jul 29, 2022Updated 3 years ago
- using g4f & embedding tools to mock openai server☆12Aug 20, 2023Updated 2 years ago
- A library to convert Pydantic models to TypedDict☆40Aug 17, 2024Updated last year
- Promptrix is a prompt layout engine for Large Language Models.☆80Nov 25, 2024Updated last year
- A simple dify bot☆34Apr 16, 2025Updated last year
- ☆33May 22, 2024Updated last year
- ⚡ Building applications with LLMs through composability ⚡☆14Mar 10, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Talking head video AI generator☆83Jan 14, 2024Updated 2 years ago
- AI SaaS Platform with Next.js 13, React, Tailwind, Prisma, Stripe, Clerk, OpenAPI, Replicate, PlanetScale, MySQL, TypeScript & Crisp.☆23Aug 30, 2024Updated last year
- Local character AI chatbot with chroma vector store memory and some scripts to process documents for Chroma☆35Oct 7, 2024Updated last year
- A drag-and-drop-enabled, responsive, envelope graph that allows to shape a wave with attack, decay, sustain and release☆11Jan 5, 2023Updated 3 years ago
- rUv-Engineer - let's you describe UI using your imagination, then see it rendered live.☆12Sep 28, 2024Updated last year
- ☆88Mar 7, 2024Updated 2 years ago
- Get the latest news about any topic or entity using GPT-4 and AYLIEN News API.☆31Feb 12, 2024Updated 2 years ago