roboflow / awesome-openai-vision-api-experiments
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API π₯
β1,667Updated this week
Alternatives and similar repositories for awesome-openai-vision-api-experiments:
Users that are interested in awesome-openai-vision-api-experiments are comparing it to the libraries listed below
- streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VLβ1,427Updated this week
- Awesome things you can do with ChatGPT + Code Interpreter combo π₯β1,006Updated last year
- ποΈ + π¬ + π§ = π€ Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]β596Updated 10 months ago
- β704Updated 10 months ago
- This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Rβ¦β1,850Updated last year
- The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".β1,305Updated last year
- Tracking Anything in High Qualityβ746Updated last year
- Turn any computer or edge device into a command center for your computer vision projects.β1,443Updated this week
- This repository is a curated collection of the most exciting and influential CVPR 2023 papers. π₯ [Paper + Code]β645Updated 6 months ago
- GPTeam: An open-source multi-agent simulationβ1,659Updated this week
- Set-of-Mark Prompting for GPT-4V and LMMsβ1,241Updated 4 months ago
- webcamGPT - chat with video stream π¬ + πΈβ260Updated 10 months ago
- π₯· Run AI-agents with an APIβ5,497Updated 2 months ago
- Lightweight GPT-4 Vision processing over the Webcamβ276Updated last year
- proof of concept prototype for generating and querying against an ever-expanding knowledge graph with aiβ868Updated 9 months ago
- Generate and auto-execute Python scripts in the cliβ1,791Updated 8 months ago
- MetaSeg: Packaged version of the Segment Anything repositoryβ963Updated this week
- Collection of notebook guides created by the Brev.dev team!β1,697Updated 3 weeks ago
- AI powered one-click comprehensive docs from transcripts and text.β1,583Updated last month
- Learn to build and deploy AI apps.β955Updated last year
- Clarity in the current fast-paced mess of Open Source innovationβ1,536Updated this week
- β7,938Updated 7 months ago
- prompt2model - Generate Deployable Models from Natural Language Instructionsβ1,977Updated 2 weeks ago
- A RAG LLM co-pilot for browsing the web, powered by local LLMsβ1,452Updated last month
- An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agentsβ5,404Updated 3 months ago
- Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"β1,667Updated 11 months ago
- Images to inference with no labeling (use foundation models to train supervised models).β2,058Updated last month
- GPT based autonomous agent designed to create personalized newspapers tailored to user preferences.β1,222Updated 6 months ago
- β908Updated 8 months ago
- γEMNLP 2024π₯γVideo-LLaVA: Learning United Visual Representation by Alignment Before Projectionβ3,113Updated last month