SkalskiP / YOLO-World
Real-Time Open-Vocabulary Object Detection
☆13Updated last year
Alternatives and similar repositories for YOLO-World
Users that are interested in YOLO-World are comparing it to the libraries listed below
Sorting:
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆63Updated 9 months ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆14Updated last year
- Passively collect images for computer vision datasets on the edge.☆33Updated last year
- Simple CogVLM client script☆14Updated last year
- EdgeSAM model for use with Autodistill.☆26Updated 11 months ago
- ☆14Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆11Updated 9 months ago
- ☆16Updated last year
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆13Updated last year
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆35Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆22Updated 7 months ago
- Streamlit app presented to the Streamlit LLMs Hackathon September 23☆16Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆19Updated 7 months ago
- ☆14Updated 5 months ago
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆46Updated 8 months ago
- ☆16Updated last year
- This repo is a packaged version of the Yolov9 model.☆89Updated last month
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆123Updated 9 months ago
- ☆15Updated last year
- ☆46Updated last year
- Nassimos07 / Moving-Stopped-Persons-Real-Time-Detection-using-YOLOv8-or-YOLOv10-Roboflow_Supervision☆9Updated last month
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆12Updated this week
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated 3 weeks ago
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆16Updated 6 months ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆16Updated 4 months ago
- Okra, your all in one personal AI assistant☆14Updated 11 months ago
- Roboflow's inference server to analyze video streams. This project extracts insights from video frames at defined intervals and generates…☆14Updated 11 months ago
- Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes…☆10Updated 2 months ago
- Seamless Voice Interactions with LLMs☆12Updated last year
- ☆13Updated last year