A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically draws bounding boxes around detected objects, labels them, and displays confidence scores.
☆221Nov 3, 2024Updated last year
Alternatives and similar repositories for Claude-Vision-Object-Detection
Users that are interested in Claude-Vision-Object-Detection are comparing it to the libraries listed below
Sorting:
- ☆10Feb 14, 2025Updated last year
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆568Nov 20, 2025Updated 3 months ago
- The very first artist assistant☆23Jul 7, 2023Updated 2 years ago
- An automated AI system (Python framework) designed to analyze any type of website content and generate structured reports using Claude 3.…☆703Nov 5, 2024Updated last year
- React app for inspecting, building and debugging with the Realtime API☆40Oct 7, 2024Updated last year
- Browser automation system that uses AI-driven planning to navigate web pages and perform goals.☆858Nov 10, 2025Updated 3 months ago
- Code for Columbia University COMS 3997 – LLM Ethics and Foundations☆14Jan 7, 2025Updated last year
- A MCP server that provides web search capabilities using the Claude API.☆50May 10, 2025Updated 9 months ago
- [NeurIPS 24] PromptFix: You Prompt and We Fix the Photo☆891Oct 4, 2024Updated last year
- 💅 A GitHub Actions workflow template for fine-tuning your own Flux model using the Replicate's API☆30Sep 24, 2024Updated last year
- ☆873Mar 18, 2025Updated 11 months ago
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆236Oct 24, 2024Updated last year
- Generate descriptions from product images in multiple languages with AI☆326Jan 20, 2025Updated last year
- ☆249Oct 16, 2024Updated last year
- ☆40May 14, 2025Updated 9 months ago
- uses all reasoning models in parallel and synthesizes an answer with o1. also has multi-chat where you can chat with any of them☆40Jan 23, 2025Updated last year
- AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…☆1,578Jan 20, 2025Updated last year
- Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist …☆11,168Dec 12, 2024Updated last year
- An AI-powered Snake game where Claude, an advanced language model, controls the serpent in real-time, showcasing intelligent decision-mak…☆47Nov 6, 2024Updated last year
- ☆3,494Nov 15, 2024Updated last year
- Jason Meridth's blog☆13Updated this week
- NextJS meets Gatsby source plugins as a graphql server☆10Jan 6, 2019Updated 7 years ago
- ☆10Jul 17, 2023Updated 2 years ago
- Multi-person podcast audio to videocast☆10Sep 28, 2024Updated last year
- upload a manim script and generate an animation☆11Mar 10, 2024Updated last year
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- A lightwight Framework for the Respiratory Sound Classification☆11Feb 12, 2025Updated last year
- ☆12Nov 21, 2024Updated last year
- Repository for tw.org site☆14Feb 11, 2026Updated 3 weeks ago
- A simple demo showing how to use the Ideogram inpainting model on Replicate using Node.js.☆14Oct 24, 2024Updated last year
- Patient Intake Form Extraction using llm☆15May 29, 2025Updated 9 months ago
- Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonom…☆4,208Mar 2, 2026Updated last week
- the simplest self-building coding agent☆1,056Oct 19, 2024Updated last year
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,987Dec 8, 2025Updated 3 months ago
- Model Context Protocol Servers (Browserbase Version)☆49Nov 26, 2024Updated last year
- gradio WebUI for AdvancedLivePortrait☆530Mar 13, 2025Updated 11 months ago
- Open source Claude Artifacts – built with Llama 3.1 405B☆6,886Updated this week
- Created with StackBlitz ⚡️☆28Nov 18, 2024Updated last year
- A Mistral chatbot app built with Expo☆11Nov 16, 2024Updated last year