stephansturges / WALDOLinks
Whereabouts Ascertainment for Low-lying Detectable Objects. The SOTA in FOSS AI for drones!
☆1,612Updated 5 months ago
Alternatives and similar repositories for WALDO
Users that are interested in WALDO are comparing it to the libraries listed below
Sorting:
- A unified library for object tracking featuring clean room re-implementations of leading multi-object tracking algorithms☆1,774Updated this week
- An open-source computer vision framework to build and deploy apps in minutes☆758Updated last year
- Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"☆6,847Updated 3 months ago
- Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)☆719Updated 2 months ago
- OpenCV+YOLO+LLAVA powered video surveillance system☆763Updated 2 weeks ago
- CoTracker is a model for tracking any point (pixel) on a video.☆4,391Updated 5 months ago
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,578Updated this week
- Turn any computer or edge device into a command center for your computer vision projects.☆1,747Updated this week
- Securade.ai HUB - A generative AI based edge platform for computer vision that connects to existing CCTV cameras and makes them smart.☆186Updated 4 months ago
- Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥☆1,681Updated 5 months ago
- RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.☆2,269Updated 2 weeks ago
- Python Computer Vision & Video Analytics Framework With Batteries Included☆651Updated this week
- Machine Assisted Visual Extraction, Reconnaissance & Intelligence for Cosmic Captures☆46Updated last year
- A fast multimodal LLM for real-time voice☆4,030Updated 4 months ago
- Detect objects in drone videos and plot them on a map☆232Updated last year
- Fast Real-time Object Detection with High-Res Output https://x.com/_akhaliq/status/1840213012818329826 https://x.com/githubprojects/statu…☆561Updated 3 months ago
- Cool experiments at the intersection of Computer Vision and Sports ⚽🏃☆513Updated last year
- [ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy☆2,518Updated 2 months ago
- Fast and accurate automatic speech recognition (ASR) for edge devices☆2,766Updated last month
- ☆1,644Updated 2 months ago
- Learn the basics of robotics through hands-on experience using ROS 2 and Gazebo simulation.☆1,719Updated 3 months ago
- computer vision and sports☆4,316Updated last month
- ROSA 🤖 is an AI Agent designed to interact with ROS1- and ROS2-based robotics systems using natural language queries. ROSA helps robot d…☆1,222Updated last week
- The official Roboflow Python package. Manage your datasets, models, and deployments. Roboflow has everything you need to build a computer…☆412Updated last week
- Instructions on how to run LLMs on Raspberry PI☆208Updated 10 months ago
- Images to inference with no labeling (use foundation models to train supervised models).☆2,300Updated last month
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything☆1,305Updated last month
- Send data with sound☆556Updated 3 months ago
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,125Updated 2 months ago
- VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 V…☆116Updated 3 weeks ago