roboflow / awesome-openai-vision-api-experiments
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥
☆1,673Updated last month
Alternatives and similar repositories for awesome-openai-vision-api-experiments:
Users that are interested in awesome-openai-vision-api-experiments are comparing it to the libraries listed below
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,373Updated this week
- Awesome things you can do with ChatGPT + Code Interpreter combo 🔥☆1,011Updated last year
- The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".☆1,305Updated last year
- Turn any computer or edge device into a command center for your computer vision projects.☆1,515Updated this week
- A RAG LLM co-pilot for browsing the web, powered by local LLMs☆1,477Updated 3 weeks ago
- 👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]☆601Updated 11 months ago
- Clarity in the current fast-paced mess of Open Source innovation☆1,540Updated last month
- Set-of-Mark Prompting for GPT-4V and LMMs☆1,282Updated 6 months ago
- Train a chatbot on an entire YouTube channel using OpenAI & Pinecone.☆367Updated last year
- ☆706Updated 11 months ago
- proof of concept prototype for generating and querying against an ever-expanding knowledge graph with ai☆881Updated 10 months ago
- Generate and auto-execute Python scripts in the cli☆1,792Updated 9 months ago
- webcamGPT - chat with video stream 💬 + 📸☆263Updated 11 months ago
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.☆1,607Updated 5 months ago
- Images to inference with no labeling (use foundation models to train supervised models).☆2,128Updated 2 months ago
- Ship RAG based LLM web apps in seconds.☆982Updated last year
- Autonomous GPT-4 agent platform☆1,019Updated 11 months ago
- ☆278Updated 6 months ago
- MetaSeg: Packaged version of the Segment Anything repository☆967Updated this week
- llama.cpp with BakLLaVA model describes what does it see☆381Updated last year
- computer vision and sports☆2,799Updated 6 months ago
- ☆582Updated last year
- A school for camelids☆1,210Updated last year
- ☆2,580Updated last month
- ☆618Updated last year
- A CLI tool & API over the top 1221 Python libraries.☆548Updated last year
- 4M: Massively Multimodal Masked Modeling☆1,686Updated this week
- Theory-of-mind powered AI tutor using o1 style reasoning☆784Updated this week
- Real-time transcription of audio, integrated with ChatGPT for interactive use. Save, load, and append transcripts for effective context m…☆440Updated last year
- ☆2,852Updated 5 months ago