Doriandarko / Claude-Vision-Object-DetectionLinks
A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically draws bounding boxes around detected objects, labels them, and displays confidence scores.
☆212Updated 10 months ago
Alternatives and similar repositories for Claude-Vision-Object-Detection
Users that are interested in Claude-Vision-Object-Detection are comparing it to the libraries listed below
Sorting:
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆160Updated 11 months ago
- ☆249Updated 7 months ago
- ☆458Updated 2 months ago
- mind map generator☆71Updated 8 months ago
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆209Updated 8 months ago
- SearchGPT / Perplexity Pages clone, but personalised for you.☆245Updated last year
- podcastfy.ai gradio demo app☆334Updated 9 months ago
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆224Updated 10 months ago
- Use OpenAI's realtime API for a chatting with your documents☆330Updated 11 months ago
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!☆233Updated 8 months ago
- napkins.dev – from screenshot to app☆86Updated 11 months ago
- A powerful Python tool for performing technical searches using the Perplexity API, optimized for retrieving precise facts, code examples,…☆209Updated 7 months ago
- PostBot 3000 is an open-source project that shows how to build a powerful AI agent and stream responses and generate artifacts. This proj…☆288Updated 9 months ago
- openperplex is an opensource AI search engine☆170Updated last year
- ☆148Updated 3 months ago
- The Open Deep Research app – generate reports with OSS LLMs☆295Updated last month
- Turn local files into a prompt for an LLM☆176Updated 7 months ago
- deep seek & o1 auto coders which write python code from a simple description and iteratively improvesit and fix errors☆96Updated 7 months ago
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆283Updated 8 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆556Updated 3 months ago
- A Chrome extension for asking questions over websites☆344Updated 7 months ago
- The AI assistant for computer control.☆319Updated 11 months ago
- Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)☆304Updated last month
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆111Updated 10 months ago
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆496Updated 3 weeks ago
- A cool AI Diagram generator from a given topic, that streams the partial diagrams from the incomplete JSONs during generation. Built usin…☆213Updated last year
- ☆136Updated 7 months ago
- ☆182Updated last week
- An amazon fresh mcp server☆64Updated 9 months ago
- Get started with native image generation and editing using Gemini 2.0 and Next.js☆489Updated last week