Doriandarko / Claude-Vision-Object-Detection
A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically draws bounding boxes around detected objects, labels them, and displays confidence scores.
☆189Updated 2 months ago
Alternatives and similar repositories for Claude-Vision-Object-Detection:
Users that are interested in Claude-Vision-Object-Detection are comparing it to the libraries listed below
- ☆217Updated last month
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆151Updated 3 months ago
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆175Updated last month
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆376Updated 2 months ago
- SearchGPT / Perplexity Pages clone, but personalised for you.☆231Updated 4 months ago
- mind map generator☆67Updated last month
- Hallucination Detector is a free and open-source tool that helps you verify the accuracy of your LLM generated content instantly.☆136Updated last week
- podcastfy.ai gradio demo app☆325Updated last month
- A cool AI Diagram generator from a given topic, that streams the partial diagrams from the incomplete JSONs during generation. Built usin…☆205Updated 9 months ago
- ☆84Updated last week
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆72Updated 4 months ago
- Claude can perform Web Search | Exa with MCP (Model Context Protocol)☆207Updated last week
- Turn local files into a prompt for an LLM☆160Updated last week
- directory for Awesome MCP Servers☆248Updated last month
- open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.☆228Updated 8 months ago
- napkins.dev – from screenshot to app☆84Updated 4 months ago
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆205Updated 3 months ago
- A powerful Python tool for performing technical searches using the Perplexity API, optimized for retrieving precise facts, code examples,…☆189Updated 2 weeks ago
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆209Updated 3 weeks ago
- PostBot 3000 is an open-source project that shows how to build a powerful AI agent and stream responses and generate artifacts. This proj…☆285Updated 2 months ago
- Turn any developer documentation into a GPT☆83Updated 4 months ago
- Filter X content using LLM API requests, configurable, based on Groq API☆130Updated 5 months ago
- Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)☆229Updated last month
- ☆178Updated 2 months ago
- The AI assistant for computer control.☆290Updated 4 months ago
- A practical approach to managing multiple AI agents in Cursor through strict file-tree partitioning and domain boundaries.☆331Updated last month
- uses gpt-4o and gpt-4-mini to write books on topics while researching with perplexity API☆80Updated 2 weeks ago
- AI Meeting Minutes analysis App built with NextJS, Langflow, Groq, and OpenAI☆404Updated last month
- Use OpenAI's realtime API for a chatting with your documents☆309Updated 3 months ago