Doriandarko / Claude-Vision-Object-Detection
A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically draws bounding boxes around detected objects, labels them, and displays confidence scores.
☆158Updated last week
Related projects ⓘ
Alternatives and complementary repositories for Claude-Vision-Object-Detection
- AI Meeting Minutes analysis App built with NextJS, Langflow, Groq, and OpenAI☆323Updated 3 weeks ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆255Updated last week
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆146Updated last month
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆175Updated 2 weeks ago
- Use OpenAI's realtime API for a chatting with your documents☆270Updated last month
- LLM, MultiModal, and Agent tools for ComfyUI☆315Updated 2 months ago
- The fastest way to build robust AI agents☆367Updated this week
- SearchGPT / Perplexity Pages clone, but personalised for you.☆218Updated 2 months ago
- Filter X content using LLM API requests, configurable, based on Groq API☆130Updated 3 months ago
- napkins.dev – from screenshot to app☆82Updated last month
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆278Updated 3 months ago
- A cool AI Diagram generator from a given topic, that streams the partial diagrams from the incomplete JSONs during generation. Built usin…☆196Updated 6 months ago
- open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.☆220Updated 6 months ago
- The AI assistant for computer control.☆260Updated last month
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆140Updated 2 weeks ago
- podcastfy.ai gradio demo app☆309Updated 2 weeks ago
- Dabbling with ReAct chatbots☆163Updated 3 months ago
- Generate accurate transcripts using Apple's MLX framework☆313Updated 2 weeks ago
- Real-Time Voice Inference Web SDK☆143Updated this week
- the simplest self-building general autonomous agent☆241Updated 3 weeks ago
- 📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)☆161Updated last week
- This repository contains the code for a virtual try-on application built using Flask, Twilio's WhatsApp API, and Gradio's virtual try-on …☆307Updated 3 weeks ago
- ☆277Updated 5 months ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆59Updated last week
- Repo of cursor prompts☆212Updated 2 months ago
- openperplex is an opensource AI search engine☆157Updated 3 months ago
- Turn any developer documentation into a GPT☆73Updated last month
- Full stack advanced chatbot over LlamaIndex.TS documentation with preview feature using Multi-documents-agents, bootstrapped with create-…☆142Updated 8 months ago
- Prompt to ui for fun☆214Updated 4 months ago
- Claude Memory: Long-term memory for Claude☆347Updated this week