Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API ๐ฅ
โ1,685Jan 14, 2025Updated last year
Alternatives and similar repositories for awesome-openai-vision-api-experiments
Users that are interested in awesome-openai-vision-api-experiments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VLโ2,661Mar 30, 2026Updated last week
- โ6,746Jun 26, 2025Updated 9 months ago
- webcamGPT - chat with video stream ๐ฌ + ๐ธโ268Feb 22, 2024Updated 2 years ago
- We write your reusable computer vision tools. ๐โ37,644Apr 1, 2026Updated last week
- ๐๏ธ + ๐ฌ + ๐ง = ๐ค Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]โ637Feb 29, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail โข AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- โ12,423Oct 25, 2025Updated 5 months ago
- Browse the web with GPT-4V and Vimiumโ2,662Sep 25, 2024Updated last year
- A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures lโฆโ9,292Mar 27, 2026Updated 2 weeks ago
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.โ24,652Aug 12, 2024Updated last year
- [arXiv 2023] Set-of-Mark Prompting for GPT-4V and LMMsโ1,524Aug 19, 2024Updated last year
- llama.cpp with BakLLaVA model describes what does it seeโ379Nov 8, 2023Updated 2 years ago
- โ9,659Oct 16, 2025Updated 5 months ago
- HAAS = Hierarchical Autonomous Agent Swarm - "Resistance is futile!"โ3,096Feb 16, 2024Updated 2 years ago
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skillsโ767Feb 1, 2024Updated 2 years ago
- Open source password manager - Proton Pass โข AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API ๐๐ฆโ61Nov 7, 2023Updated 2 years ago
- Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app โฆโ6,513Mar 23, 2026Updated 2 weeks ago
- โ4,166May 2, 2025Updated 11 months ago
- Crawl a site to generate knowledge files to create your own custom GPT from a URLโ22,227Jul 7, 2025Updated 9 months ago
- AI companions with memory: a lightweight stack to create and host your own AI companionsโ5,944Apr 23, 2024Updated last year
- Turn any computer or edge device into a command center for your computer vision projects.โ2,242Apr 4, 2026Updated last week
- [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wildโ4,754Nov 18, 2024Updated last year
- OCR, layout analysis, reading order, table recognition in 90+ languagesโ19,557Apr 3, 2026Updated last week
- An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agentsโ5,895Sep 26, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways โข AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Consistency Distilled Diff VAEโ2,212Nov 7, 2023Updated 2 years ago
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.โ21,988Updated this week
- tiny vision language modelโ9,554Nov 14, 2025Updated 4 months ago
- A natural language interface for computersโ63,040Feb 9, 2026Updated 2 months ago
- Foundational Models for State-of-the-Art Speech and Text Translationโ11,769Updated this week
- Build ChatGPT over your data, all with natural languageโ6,528Apr 5, 2024Updated 2 years ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)โ12,798Mar 23, 2026Updated 2 weeks ago
- โ2,528Apr 3, 2024Updated 2 years ago
- ๐พ Open source implementation of the ChatGPT Code Interpreterโ3,855Nov 7, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways โข AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- a state-of-the-art-level open visual language model | ๅคๆจกๆ้ข่ฎญ็ปๆจกๅโ6,734May 29, 2024Updated last year
- structured outputs for llmsโ12,702Apr 3, 2026Updated last week
- OpenChat: Advancing Open-source Language Models with Imperfect Dataโ5,477Sep 13, 2024Updated last year
- Awesome things you can do with ChatGPT + Code Interpreter combo ๐ฅโ1,013Dec 10, 2023Updated 2 years ago
- โ5,118Mar 26, 2025Updated last year
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.โ13,280Apr 4, 2026Updated last week
- Images to inference with no labeling (use foundation models to train supervised models).โ2,658May 14, 2025Updated 10 months ago