Doriandarko / Claude-Vision-Object-DetectionLinks
A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically draws bounding boxes around detected objects, labels them, and displays confidence scores.
☆220Updated last year
Alternatives and similar repositories for Claude-Vision-Object-Detection
Users that are interested in Claude-Vision-Object-Detection are comparing it to the libraries listed below
Sorting:
- ☆252Updated 10 months ago
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆160Updated last year
- podcastfy.ai gradio demo app☆334Updated last year
- 🔥 Generate llms.txt and llms-full.txt files for any website!☆493Updated 6 months ago
- mind map generator☆72Updated last year
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆212Updated 2 months ago
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆227Updated last year
- napkins.dev – from screenshot to app☆86Updated last year
- Use OpenAI's realtime API for a chatting with your documents☆330Updated last year
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!☆237Updated 11 months ago
- An amazon fresh mcp server☆63Updated last year
- A powerful Python tool for performing technical searches using the Perplexity API, optimized for retrieving precise facts, code examples,…☆210Updated 11 months ago
- SearchGPT / Perplexity Pages clone, but personalised for you.☆246Updated last year
- ☆158Updated 2 weeks ago
- The AI assistant for computer control.☆325Updated last year
- A Chrome extension for asking questions over websites☆354Updated 10 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆564Updated last month
- Turn local files into a prompt for an LLM☆178Updated 11 months ago
- deep seek & o1 auto coders which write python code from a simple description and iteratively improvesit and fix errors☆95Updated 11 months ago
- ☆191Updated last year
- Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)☆318Updated 3 months ago
- A cool AI Diagram generator from a given topic, that streams the partial diagrams from the incomplete JSONs during generation. Built usin…☆215Updated last year
- PostBot 3000 is an open-source project that shows how to build a powerful AI agent and stream responses and generate artifacts. This proj…☆288Updated last year
- openperplex is an opensource AI search engine☆171Updated last year
- ☆136Updated 10 months ago
- Convert PowerPoint files into semantically rich text using vision language models☆109Updated last month
- The Open Deep Research app – generate reports with OSS LLMs☆313Updated this week
- Youtube API Server used in https://git.new/scira☆343Updated 4 months ago
- Repo of cursor prompts☆243Updated 8 months ago
- Example code and guides for building with Scrapybara☆138Updated 9 months ago