Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API š„
ā1,683Jan 14, 2025Updated last year
Alternatives and similar repositories for awesome-openai-vision-api-experiments
Users that are interested in awesome-openai-vision-api-experiments are comparing it to the libraries listed below
Sorting:
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VLā2,659Feb 23, 2026Updated last week
- ā6,753Jun 26, 2025Updated 8 months ago
- webcamGPT - chat with video stream š¬ + šøā268Feb 22, 2024Updated 2 years ago
- We write your reusable computer vision tools. šā36,612Updated this week
- šļø + š¬ + š§ = š¤ Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]ā637Feb 29, 2024Updated 2 years ago
- ā8,818Oct 25, 2025Updated 4 months ago
- Browse the web with GPT-4V and Vimiumā2,667Sep 25, 2024Updated last year
- A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures lā¦ā9,218Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.ā24,500Aug 12, 2024Updated last year
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skillsā763Feb 1, 2024Updated 2 years ago
- HAAS = Hierarchical Autonomous Agent Swarm - "Resistance is futile!"ā3,093Feb 16, 2024Updated 2 years ago
- ā9,666Oct 16, 2025Updated 4 months ago
- ā4,169May 2, 2025Updated 10 months ago
- llama.cpp with BakLLaVA model describes what does it seeā379Nov 8, 2023Updated 2 years ago
- Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app ā¦ā6,422Feb 3, 2026Updated last month
- Turn any computer or edge device into a command center for your computer vision projects.ā2,205Updated this week
- An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agentsā5,876Sep 26, 2024Updated last year
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API šš¦ā62Nov 7, 2023Updated 2 years ago
- Build ChatGPT over your data, all with natural languageā6,534Apr 5, 2024Updated last year
- AI companions with memory: a lightweight stack to create and host your own AI companionsā5,941Apr 23, 2024Updated last year
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)ā12,734Feb 9, 2026Updated 3 weeks ago
- [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wildā4,716Nov 18, 2024Updated last year
- OCR, layout analysis, reading order, table recognition in 90+ languagesā19,360Feb 24, 2026Updated last week
- Crawl a site to generate knowledge files to create your own custom GPT from a URLā22,185Jul 7, 2025Updated 7 months ago
- tiny vision language modelā9,364Nov 14, 2025Updated 3 months ago
- OpenChat: Advancing Open-source Language Models with Imperfect Dataā5,475Sep 13, 2024Updated last year
- ā2,525Apr 3, 2024Updated last year
- š¾ Open source implementation of the ChatGPT Code Interpreterā3,860Nov 7, 2024Updated last year
- Foundational Models for State-of-the-Art Speech and Text Translationā11,762Nov 14, 2024Updated last year
- [arXiv 2023] Set-of-Mark Prompting for GPT-4V and LMMsā1,517Aug 19, 2024Updated last year
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.ā21,340Feb 24, 2026Updated last week
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.ā4,049Jan 8, 2025Updated last year
- Awesome things you can do with ChatGPT + Code Interpreter combo š„ā1,017Dec 10, 2023Updated 2 years ago
- structured outputs for llmsā12,468Feb 25, 2026Updated last week
- A natural language interface for computersā62,427Feb 9, 2026Updated 3 weeks ago
- Large Action Model framework to develop AI Web Agentsā6,311Jan 21, 2025Updated last year
- Consistency Distilled Diff VAEā2,209Nov 7, 2023Updated 2 years ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.ā13,206Updated this week
- Collection of all the GPTs created by the communityā1,356Apr 21, 2024Updated last year