Doriandarko / Claude-Vision-Object-Detection
A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically draws bounding boxes around detected objects, labels them, and displays confidence scores.
☆190Updated 4 months ago
Alternatives and similar repositories for Claude-Vision-Object-Detection:
Users that are interested in Claude-Vision-Object-Detection are comparing it to the libraries listed below
- ☆274Updated 2 weeks ago
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆153Updated 4 months ago
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆185Updated 2 months ago
- Turn local files into a prompt for an LLM☆165Updated last month
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆210Updated 4 months ago
- Official Firecrawl MCP Server - Adds powerful web scraping to Cursor, Claude and any other LLM clients.☆371Updated last week
- SearchGPT / Perplexity Pages clone, but personalised for you.☆235Updated 6 months ago
- ☆184Updated 3 months ago
- A Chrome extension for asking questions over websites☆306Updated 3 weeks ago
- mind map generator☆69Updated 2 months ago
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!☆215Updated 2 months ago
- Claude can perform Web Search | Exa with MCP (Model Context Protocol)☆247Updated this week
- podcastfy.ai gradio demo app☆329Updated 3 months ago
- A powerful Python tool for performing technical searches using the Perplexity API, optimized for retrieving precise facts, code examples,…☆202Updated last month
- Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)☆261Updated 3 weeks ago
- ☆96Updated last month
- The AI assistant for computer control.☆299Updated 5 months ago
- A cool AI Diagram generator from a given topic, that streams the partial diagrams from the incomplete JSONs during generation. Built usin…☆209Updated 10 months ago
- Dabbling with ReAct chatbots☆174Updated 6 months ago
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆243Updated 2 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆518Updated 3 weeks ago
- openperplex is an opensource AI search engine☆164Updated 7 months ago
- New user experiences for interacting with agents☆429Updated this week
- Hallucination Detector is a free and open-source tool that helps you verify the accuracy of your LLM generated content instantly.☆162Updated last month
- Assistant for voice-to-blog writing☆129Updated last month
- uses gpt-4o and gpt-4-mini to write books on topics while researching with perplexity API☆86Updated last month