SkalskiP / YOLO-World
Real-Time Open-Vocabulary Object Detection
☆13Updated last year
Alternatives and similar repositories for YOLO-World:
Users that are interested in YOLO-World are comparing it to the libraries listed below
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆14Updated last year
- Passively collect images for computer vision datasets on the edge.☆32Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆11Updated 8 months ago
- ☆14Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆15Updated this week
- The Web Metadata Extraction Toolkit is designed to streamline the process of extracting, cleaning, and analyzing metadata from websites. …☆16Updated 9 months ago
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆11Updated this week
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆13Updated last year
- Roboflow's inference server to analyze video streams. This project extracts insights from video frames at defined intervals and generates…☆14Updated 11 months ago
- Flask-based web application designed to compare text and image embeddings using the CLIP model.☆22Updated last year
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆43Updated 7 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆19Updated 6 months ago
- ☆13Updated last year
- 🤖 Sam-assistant is a personal assistant that is designed to understand your documents, search the internet, and in future versions, crea…☆13Updated last year
- EdgeSAM model for use with Autodistill.☆26Updated 10 months ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆15Updated 3 months ago
- A minimal Model Context Protocol 🖥️ server/client🧑💻with Azure OpenAI and 🌐 web browser control via Playwright.☆15Updated 2 weeks ago
- ☆10Updated 8 months ago
- ☆16Updated last year
- AI Search engine☆12Updated 2 months ago
- ☆16Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated this week
- 6D Rotation Representation for Unconstrained Head Pose Estimation☆13Updated last year
- Tool4AI: A model agnostic, LLM friendly router for tool/function call☆17Updated 8 months ago
- Simple CogVLM client script☆14Updated last year
- ☆12Updated last year
- ☆12Updated 11 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆63Updated 8 months ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆21Updated last month
- AgentFence is an open-source platform for automatically testing AI agent security. It identifies vulnerabilities such as prompt injection…☆11Updated last month