Doriandarko / Claude-Vision-Object-Detection
A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically draws bounding boxes around detected objects, labels them, and displays confidence scores.
☆200Updated 5 months ago
Alternatives and similar repositories for Claude-Vision-Object-Detection:
Users that are interested in Claude-Vision-Object-Detection are comparing it to the libraries listed below
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆195Updated 4 months ago
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆156Updated 6 months ago
- SearchGPT / Perplexity Pages clone, but personalised for you.☆237Updated 7 months ago
- ☆368Updated 3 weeks ago
- mind map generator☆71Updated 4 months ago
- ☆245Updated 2 months ago
- A powerful Python tool for performing technical searches using the Perplexity API, optimized for retrieving precise facts, code examples,…☆205Updated 3 months ago
- ☆184Updated 5 months ago
- Youtube API Server used in https://git.new/scira☆321Updated last month
- ☆124Updated last month
- An amazon fresh mcp server☆62Updated 5 months ago
- napkins.dev – from screenshot to app☆85Updated 7 months ago
- openperplex is an opensource AI search engine☆165Updated 8 months ago
- ☆122Updated last week
- ☆133Updated 2 months ago
- Turn local files into a prompt for an LLM☆171Updated 3 months ago
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆214Updated 6 months ago
- podcastfy.ai gradio demo app☆330Updated 4 months ago
- 🧍♂️LLM as a manager for approval processes.☆154Updated last week
- MCP server for enabling LLM applications to perform deep research via the MCP protocol☆93Updated 3 weeks ago
- PostBot 3000 is an open-source project that shows how to build a powerful AI agent and stream responses and generate artifacts. This proj…☆287Updated 5 months ago
- An implementation of a computer use agent (CUA) using LangGraph☆139Updated last month
- A cool AI Diagram generator from a given topic, that streams the partial diagrams from the incomplete JSONs during generation. Built usin…☆210Updated last year
- NotebookLlama powered by Groq - Create podcasts on any topic lightning fast☆70Updated 6 months ago
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆81Updated 7 months ago
- Use OpenAI's realtime API for a chatting with your documents☆325Updated 6 months ago
- Filter X content using LLM API requests, configurable, based on Groq API☆132Updated 8 months ago
- Turn any developer documentation into a GPT☆92Updated last month
- Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)☆275Updated 2 months ago
- Extract information from any website by chatting with AI - Fork of Vercel AI Chatbot w/ Firecrawl Integrated☆116Updated 3 months ago