Doriandarko / Claude-Vision-Object-DetectionLinks
A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically draws bounding boxes around detected objects, labels them, and displays confidence scores.
β218Updated last year
Alternatives and similar repositories for Claude-Vision-Object-Detection
Users that are interested in Claude-Vision-Object-Detection are comparing it to the libraries listed below
Sorting:
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Consoleβ160Updated last year
- π₯ Generate llms.txt and llms-full.txt files for any website!β485Updated 5 months ago
- β251Updated 10 months ago
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.β226Updated last year
- podcastfy.ai gradio demo appβ334Updated last year
- Gemini Multimodal Live + WebRTC in a single `app.ts`β211Updated last month
- mind map generatorβ72Updated 11 months ago
- SearchGPT / Perplexity Pages clone, but personalised for you.β245Updated last year
- A powerful Python tool for performing technical searches using the Perplexity API, optimized for retrieving precise facts, code examples,β¦β209Updated 10 months ago
- Convert PowerPoint files into semantically rich text using vision language modelsβ107Updated 2 weeks ago
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!β236Updated 11 months ago
- Use OpenAI's realtime API for a chatting with your documentsβ330Updated last year
- Turn local files into a prompt for an LLMβ177Updated 10 months ago
- openperplex is an opensource AI search engineβ171Updated last year
- napkins.dev β from screenshot to appβ86Updated last year
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard thoughβ562Updated last week
- The AI assistant for computer control.β322Updated last year
- deep seek & o1 auto coders which write python code from a simple description and iteratively improvesit and fix errorsβ95Updated 10 months ago
- PostBot 3000 is an open-source project that shows how to build a powerful AI agent and stream responses and generate artifacts. This projβ¦β288Updated last year
- β156Updated last month
- A cool AI Diagram generator from a given topic, that streams the partial diagrams from the incomplete JSONs during generation. Built usinβ¦β214Updated last year
- An amazon fresh mcp serverβ63Updated last year
- β189Updated last year
- An AI cursor for desktop using Gemini 2.0 Flash (Experimental)β337Updated 9 months ago
- β137Updated 9 months ago
- The Open Deep Research app β generate reports with OSS LLMsβ311Updated this week
- A Chrome extension for asking questions over websitesβ353Updated 9 months ago
- Realtime Voice and Vision wtih Brilliant Labs Frame and Geminiβ66Updated 6 months ago
- Youtube API Server used in https://git.new/sciraβ341Updated 4 months ago
- NotebookLlama powered by Groq - Create podcasts on any topic lightning fastβ77Updated last year