roboflow / awesome-openai-vision-api-experiments
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥
☆1,647Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-openai-vision-api-experiments
- streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL☆1,390Updated this week
- Set-of-Mark Prompting for GPT-4V and LMMs☆1,185Updated 3 months ago
- Awesome things you can do with ChatGPT + Code Interpreter combo 🔥☆999Updated 11 months ago
- 👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]☆577Updated 8 months ago
- A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and …☆1,370Updated this week
- This repository is a curated collection of the most exciting and influential CVPR 2023 papers. 🔥 [Paper + Code]☆639Updated 4 months ago
- Images to inference with no labeling (use foundation models to train supervised models).☆1,989Updated 2 weeks ago
- Common interface for interacting with AI agents. The protocol is tech stack agnostic - you can use it with any framework for building age…☆997Updated 5 months ago
- ☆700Updated 8 months ago
- Vision utilities for web interaction agents 👀☆1,450Updated this week
- 🥷 Run AI-agents with an API☆5,321Updated last month
- Agent techniques to augment your LLM and push it beyong its limits☆1,545Updated 5 months ago
- ☆6,481Updated 2 months ago
- Top ranked OpenAI GPTs☆977Updated 8 months ago
- The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".☆1,298Updated 10 months ago
- Repo of custom instructions that you can use for ChatGPT☆1,267Updated 3 months ago
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆1,397Updated this week
- proof of concept prototype for generating and querying against an ever-expanding knowledge graph with ai☆857Updated 7 months ago
- Tracking Anything in High Quality☆744Updated 11 months ago
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,639Updated 3 weeks ago
- Automatically create prompts and make them fight each other to know which is the best☆559Updated last year
- Ship RAG based LLM web apps in seconds.☆976Updated 9 months ago
- A CLI tool & API over the top 1221 Python libraries.☆547Updated 10 months ago
- 👾 Open source implementation of the ChatGPT Code Interpreter☆3,794Updated 2 weeks ago
- AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI☆983Updated 2 months ago
- Create browser automation as if you were teaching a human using GPT-4 Vision.☆564Updated 9 months ago
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆1,265Updated this week
- Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"☆1,661Updated 9 months ago
- Official repo for MM-REACT☆935Updated 9 months ago
- Examples of using E2B☆738Updated this week