roboflow / trackersLinks

A unified library for object tracking featuring clean room re-implementations of leading multi-object tracking algorithms

☆1,774

Alternatives and similar repositories for trackers

Users that are interested in trackers are comparing it to the libraries listed below

Sorting:

roboflow / rf-detr
RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.
☆2,269Updated 2 weeks ago
yangchris11 / samurai
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
☆6,847Updated 3 months ago
THU-MIG / yoloe
YOLOE: Real-Time Seeing Anything
☆1,364Updated last month
apple / ml-fastvlm
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
☆4,233Updated last month
NVlabs / describe-anything
Implementation for Describe Anything: Detailed Localized Image and Video Captioning
☆1,179Updated last month
manycore-research / SpatialLM
SpatialLM: Training Large Language Models for Structured Indoor Modeling
☆3,395Updated 2 weeks ago
deltacv / PaperVision
Create your custom OpenCV algorithms using a user-friendly node editor interface, inspired by Blender and Unreal Engine blueprints! Quic…
☆369Updated last month
SkalskiP / top-cvpr-2025-papers
About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. 🔥 [Paper + Code + Demo]
☆609Updated last week
roboflow / maestro
streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL
☆2,578Updated this week
roboflow / sports
computer vision and sports
☆4,316Updated last month
fkryan / gazelle
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)
☆719Updated 2 months ago
prs-eth / thera
Thera: Aliasing-Free Arbitrary-Scale Super-Resolution with Neural Heat Fields
☆807Updated last month
ngxson / smolvlm-realtime-webcam
Real-time webcam demo with SmolVLM and llama.cpp server
☆3,969Updated last month
siyuanliii / masa
Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything
☆1,305Updated last month
SkalskiP / vlms-zero-to-hero
This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.
☆1,092Updated 5 months ago
IDEA-Research / DINO-X-API
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
☆1,096Updated last week
microsoft / Magma
[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents
☆1,725Updated 3 weeks ago
attentionmech / mav
Model Activity Visualiser
☆506Updated 2 months ago
Peterande / D-FINE
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
☆2,464Updated 2 months ago
yformer / EfficientTAM
Efficient Track Anything
☆571Updated 5 months ago
abhiemj / manim-mcp-server
☆403Updated last month
kyutai-labs / hibiki
Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…
☆1,125Updated 2 months ago
facebookresearch / perception_models
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
☆1,294Updated 3 weeks ago
TIGER-AI-Lab / TheoremExplainAgent
Official Repo for "TheoremExplainAgent: Towards Video-based Multimodal Explanations for LLM Theorem Understanding" [ACL 2025]
☆1,309Updated this week
IDEA-Research / Grounded-SAM-2
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
☆2,347Updated last month
sofi444 / realtime-transcription-fastrtc
Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗
☆661Updated last week
NVIDIA-AI-Blueprints / pdf-to-podcast
Transform PDFs into AI podcasts for engaging on-the-go audio content.
☆671Updated 3 weeks ago
stephansturges / WALDO
Whereabouts Ascertainment for Low-lying Detectable Objects. The SOTA in FOSS AI for drones!
☆1,612Updated 5 months ago
NanoNets / docext
An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
☆1,220Updated last week
huggingface / nanoVLM
The simplest, fastest repository for training/finetuning small-sized VLMs.
☆3,558Updated this week