SkalskiP / YOLO-WorldLinks
Real-Time Open-Vocabulary Object Detection
☆12Updated last year
Alternatives and similar repositories for YOLO-World
Users that are interested in YOLO-World are comparing it to the libraries listed below
Sorting:
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Updated last year
- The UnisonAI Multi-Agent Framework built on custom workflow which allows ai agents to talk together and provides a flexible and extensibl…☆24Updated 2 weeks ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆12Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆17Updated 2 months ago
- Add real-time Speech-to-Text to your LiveKit application with AssemblyAI☆18Updated 6 months ago
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆16Updated this week
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆15Updated last year
- Simple CogVLM client script☆14Updated 2 years ago
- Auto-Video maker handling many AI's☆11Updated last year
- LiveKit + Next.js AI voice agent interface☆16Updated 9 months ago
- 🧬 [WIP] Lobe Flow - an open-source ai powered node flow editor☆22Updated 2 years ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆17Updated 2 weeks ago
- ☆17Updated last year
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆15Updated last year
- The Web Metadata Extraction Toolkit is designed to streamline the process of extracting, cleaning, and analyzing metadata from websites. …☆18Updated last year
- ☆16Updated last year
- ☆29Updated 2 years ago
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆31Updated 6 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆68Updated last year
- Streamlit app presented to the Streamlit LLMs Hackathon September 23☆16Updated last year
- Flask-based web application designed to compare text and image embeddings using the CLIP model.☆22Updated last year
- ☆13Updated last year
- Passively collect images for computer vision datasets on the edge.☆35Updated 2 years ago
- an auto coder which automatically fixes errors and improves the code from simple user prompt☆36Updated 11 months ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆20Updated 2 months ago
- Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes…☆10Updated 10 months ago
- ☆17Updated last year
- Object segmentation in collaboration with Segment Anyting Model and Yolov8☆25Updated 2 years ago
- ☆17Updated 2 years ago
- Tool4AI: A model agnostic, LLM friendly router for tool/function call☆19Updated last year