roboflow / awesome-openai-vision-api-experimentsLinks
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API π₯
β1,684Updated 7 months ago
Alternatives and similar repositories for awesome-openai-vision-api-experiments
Users that are interested in awesome-openai-vision-api-experiments are comparing it to the libraries listed below
Sorting:
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VLβ2,606Updated last week
- Awesome things you can do with ChatGPT + Code Interpreter combo π₯β1,013Updated last year
- β935Updated last year
- Clarity in the current fast-paced mess of Open Source innovationβ1,589Updated 6 months ago
- An open source wearable with cameraβ611Updated last year
- Lightweight GPT-4 Vision processing over the Webcamβ285Updated last year
- I play with my best friend GPTβ293Updated last year
- ποΈ + π¬ + π§ = π€ Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]β627Updated last year
- Record voice notes & transcribe, summarize, and get tasksβ1,983Updated 5 months ago
- β712Updated last year
- Locally hosted tool that connects documents to LLMs for summarization and querying, with a simple GUI.β795Updated 2 years ago
- webcamGPT - chat with video stream π¬ + πΈβ265Updated last year
- Learn to build and deploy AI apps.β970Updated 2 years ago
- Ship RAG based LLM web apps in seconds.β997Updated last year
- AI tutor powered by Theory-of-Mind reasoningβ839Updated last week
- Real-time transcription of audio, integrated with ChatGPT for interactive use. Save, load, and append transcripts for effective context mβ¦β440Updated 2 years ago
- A python script to help manage a Gmail inbox by filtering out promotional emails using GPT-3 or GPT-4.β448Updated last year
- This repository contains two Python scripts that demonstrate how to create a chatbot using Streamlit, OpenAI GPT-3.5-turbo, and Activelooβ¦β1,153Updated last year
- The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".β1,309Updated last year
- Agent techniques to augment your LLM and push it beyong its limitsβ1,583Updated last year
- [arXiv 2023] Set-of-Mark Prompting for GPT-4V and LMMsβ1,440Updated 11 months ago
- Video Search and Streaming Agent π΅οΈββοΈβ484Updated last year
- Top ranked OpenAI GPTsβ1,013Updated 3 weeks ago
- Yes, it's another chat over documents implementation... but this one is entirely local!β1,787Updated 4 months ago
- β2,263Updated last year
- Tracking Anything in High Qualityβ752Updated last year
- prompt2model - Generate Deployable Models from Natural Language Instructionsβ2,010Updated 7 months ago
- Autonomous GPT-4 agent platformβ1,034Updated last year
- This repository is a curated collection of the most exciting and influential CVPR 2023 papers. π₯ [Paper + Code]β656Updated 2 months ago
- π€ Everything you need to create an LLM Agentβtools, prompts, frameworks, and modelsβall in one place.β1,859Updated 2 months ago