roboflow / awesome-openai-vision-api-experiments
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API π₯
β1,678Updated 2 months ago
Alternatives and similar repositories for awesome-openai-vision-api-experiments:
Users that are interested in awesome-openai-vision-api-experiments are comparing it to the libraries listed below
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VLβ2,513Updated this week
- Awesome things you can do with ChatGPT + Code Interpreter combo π₯β1,011Updated last year
- ποΈ + π¬ + π§ = π€ Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]β606Updated last year
- Images to inference with no labeling (use foundation models to train supervised models).β2,178Updated this week
- webcamGPT - chat with video stream π¬ + πΈβ265Updated last year
- Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and π video, up to 5x faster thanβ¦β1,101Updated 2 months ago
- This repository is a curated collection of the most exciting and influential CVPR 2023 papers. π₯ [Paper + Code]β646Updated 8 months ago
- I play with my best friend GPTβ295Updated last year
- Set-of-Mark Prompting for GPT-4V and LMMsβ1,324Updated 7 months ago
- Tracking Anything in High Qualityβ748Updated last year
- MetaSeg: Packaged version of the Segment Anything repositoryβ972Updated last week
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.β1,632Updated 6 months ago
- The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".β1,310Updated last year
- β707Updated last year
- Turn any computer or edge device into a command center for your computer vision projects.β1,578Updated this week
- Agent techniques to augment your LLM and push it beyong its limitsβ1,571Updated 10 months ago
- Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVIβ¦β6,660Updated 9 months ago
- Real-time transcription of audio, integrated with ChatGPT for interactive use. Save, load, and append transcripts for effective context mβ¦β439Updated last year
- β925Updated 10 months ago
- prompt2model - Generate Deployable Models from Natural Language Instructionsβ1,983Updated 2 months ago
- proof of concept prototype for generating and querying against an ever-expanding knowledge graph with aiβ884Updated 11 months ago
- Video Search and Streaming Agent π΅οΈββοΈβ464Updated last year
- AI tutor powered by Theory-of-Mind reasoningβ799Updated this week
- Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)β3,371Updated last month
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-beβ¦β2,887Updated this week
- A school for camelidsβ1,208Updated last year
- β6,605Updated last month
- Ship RAG based LLM web apps in seconds.β987Updated last year
- An open source wearable with cameraβ598Updated 10 months ago
- Chat with your PDFs with AIβ1,219Updated 2 months ago