SkalskiP / YOLO-WorldLinks
Real-Time Open-Vocabulary Object Detection
☆12Updated last year
Alternatives and similar repositories for YOLO-World
Users that are interested in YOLO-World are comparing it to the libraries listed below
Sorting:
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆12Updated last year
 - Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆15Updated last week
 - A swarm of LLM agents that will help you test, document, and productionize your code!☆16Updated last week
 - AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆16Updated 3 weeks ago
 - The UnisonAI Multi-Agent Framework (A2A) provides a flexible and extensible environment for creating and coordinating multiple autonomous…☆22Updated last week
 - Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆14Updated last year
 - Add real-time Speech-to-Text to your LiveKit application with AssemblyAI☆17Updated 4 months ago
 - My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆11Updated last year
 - LiveKit + Next.js AI voice agent interface☆15Updated 8 months ago
 - ☆17Updated last year
 - Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆67Updated last year
 - Streamlit app presented to the Streamlit LLMs Hackathon September 23☆15Updated last year
 - The Web Metadata Extraction Toolkit is designed to streamline the process of extracting, cleaning, and analyzing metadata from websites. …☆18Updated last year
 - Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆22Updated last year
 - Roboflow's inference server to analyze video streams. This project extracts insights from video frames at defined intervals and generates…☆13Updated last year
 - ☆21Updated 9 months ago
 - ☆47Updated last year
 - Passively collect images for computer vision datasets on the edge.☆35Updated 2 years ago
 - Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated 5 months ago
 - Use this code to access pipeline to Gemini from inside notebookLM☆32Updated last year
 - ☆29Updated last year
 - an auto coder which automatically fixes errors and improves the code from simple user prompt☆36Updated 10 months ago
 - AI Search engine☆12Updated last month
 - Gradio UI to load crewAI configuration from excel xls and generate the python code. The source of the crews is in the xls. It allows for …☆10Updated 2 weeks ago
 - RestAI's Frontend☆20Updated last month
 - ☆17Updated last year
 - Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆17Updated this week
 - 🧬 [WIP] Lobe Flow - an open-source ai powered node flow editor☆22Updated last year
 - Luann (fka TypeAgent) allows you to create many LLM based agent(Various types of agent,scale up)☆22Updated 6 months ago
 - Tool4AI: A model agnostic, LLM friendly router for tool/function call☆19Updated last year