Doriandarko / Claude-Vision-Object-DetectionLinks
A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically draws bounding boxes around detected objects, labels them, and displays confidence scores.
☆204Updated 7 months ago
Alternatives and similar repositories for Claude-Vision-Object-Detection
Users that are interested in Claude-Vision-Object-Detection are comparing it to the libraries listed below
Sorting:
- ☆421Updated this week
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆206Updated 6 months ago
- ☆246Updated 4 months ago
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆158Updated 8 months ago
- mind map generator☆72Updated 6 months ago
- ☆138Updated last month
- SearchGPT / Perplexity Pages clone, but personalised for you.☆242Updated 9 months ago
- napkins.dev – from screenshot to app☆86Updated 8 months ago
- openperplex is an opensource AI search engine☆166Updated 10 months ago
- ☆154Updated last week
- deep seek & o1 auto coders which write python code from a simple description and iteratively improvesit and fix errors☆98Updated 5 months ago
- ☆182Updated 6 months ago
- Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)☆290Updated this week
- A powerful Python tool for performing technical searches using the Perplexity API, optimized for retrieving precise facts, code examples,…☆206Updated 5 months ago
- An implementation of a computer use agent (CUA) using LangGraph☆158Updated 2 months ago
- Youtube API Server used in https://git.new/scira☆328Updated 3 months ago
- podcastfy.ai gradio demo app☆334Updated 6 months ago
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆218Updated 7 months ago
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!☆226Updated 5 months ago
- ☆451Updated this week
- Assistant for voice-to-blog writing☆137Updated 4 months ago
- Extract information from any website by chatting with AI - Fork of Vercel AI Chatbot w/ Firecrawl Integrated☆122Updated 4 months ago
- Local Groq Desktop chat app with MCP support☆284Updated last week
- Convert PowerPoint files into semantically rich text using vision language models☆99Updated 3 months ago
- 🧍♂️LLM as a manager for approval processes.☆200Updated 2 months ago
- uses gpt-4o and gpt-4-mini to write books on topics while researching with perplexity API☆92Updated 5 months ago
- Example code and guides for building with Scrapybara☆130Updated 3 months ago
- Run coding agents in a secure sandbox. A simple SDK for safely running Codex and Claude Code in your app or workflow. 🖖☆213Updated this week
- Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilio☆69Updated this week
- Find the best OSS coding LLMs by watching them battle☆100Updated 6 months ago