☆10Apr 7, 2025Updated 10 months ago
Alternatives and similar repositories for TrackingMeetsLMM
Users that are interested in TrackingMeetsLMM are comparing it to the libraries listed below
Sorting:
- Official implementation of "SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Trackin…☆41Oct 19, 2025Updated 4 months ago
- [NeurIPS 2024] Repository for the paper "OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking".☆27Nov 9, 2024Updated last year
- ☆25Dec 23, 2024Updated last year
- [ICRA 2025] LaMOT: Language-Guided Multi-Object Tracking☆29Feb 10, 2025Updated last year
- TrackGPT: Track What You Need in Videos via Text Prompts☆25May 16, 2023Updated 2 years ago
- Vision-Language based Visual Object Tracking☆27Oct 10, 2025Updated 4 months ago
- ☆14Jul 15, 2024Updated last year
- 🏄 [ICLR 2025] OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer☆88Aug 4, 2025Updated 6 months ago
- ☆20Mar 2, 2025Updated last year
- LEO: A powerful Hybrid Multimodal LLM☆19Jan 18, 2025Updated last year
- ☆19Jul 25, 2024Updated last year
- Multi-Granularity Language-Guided Multi-Object Tracking☆24Nov 3, 2025Updated 4 months ago
- Segment Anything with Deictic Prompting☆27May 13, 2025Updated 9 months ago
- OVTrack: Open-Vocabulary Multiple Object Tracking [CVPR 2023]☆112Oct 14, 2024Updated last year
- Segment This Thing is an efficient image segmentation models that uses a biologically-inspired foveated tokenization to reduce inference …☆55Jun 16, 2025Updated 8 months ago
- 🚀【AAAI 2025】Cross-View Referring Multi-Object Tracking☆74Feb 4, 2026Updated 3 weeks ago
- Code for Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking☆33Mar 14, 2025Updated 11 months ago
- StreamingFlow: Streaming Occupancy Forecasting with Asynchronous Multi-modal Data Streams via Neural Ordinary Differential Equation☆34Jul 1, 2024Updated last year
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆12Oct 14, 2024Updated last year
- Visual Spatial Tuning☆176Feb 19, 2026Updated last week
- Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown☆41Feb 22, 2026Updated last week
- Official implementation of "Referring Video Object Segmentation via Language Aligned Track Selection".☆40Jun 2, 2025Updated 9 months ago
- [ECCV 2024] Elysium: Exploring Object-level Perception in Videos via MLLM☆86Oct 25, 2024Updated last year
- (ICCV2025) Official repository of paper "ViSpeak: Visual Instruction Feedback in Streaming Videos"☆45Jul 1, 2025Updated 8 months ago
- [CVPR2024] Towards Generalizable Multi-Object Tracking☆33May 3, 2024Updated last year
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- 单细胞测序的高级教程☆11Apr 12, 2021Updated 4 years ago
- ☆10Jul 30, 2024Updated last year
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- ☆96Dec 17, 2024Updated last year
- Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving☆62Jul 25, 2025Updated 7 months ago
- Fully Sparse Transformer 3D Detector for LiDAR Point Cloud☆37Dec 8, 2023Updated 2 years ago
- [AAAI 2025] AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video…☆91Dec 23, 2024Updated last year
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- ☆11Jun 13, 2025Updated 8 months ago
- The official implementation of the TIP 2025 paper UncTrack: Reliable Visual Object Tracking with Uncertainty-Aware Prototype Memory Netwo…☆13Jun 16, 2025Updated 8 months ago
- CoMA: Compositional Human Motion Generation with Multi-modal Agents☆14Jul 31, 2025Updated 7 months ago
- [ICCV 2025] Official code for paper: Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs☆68Jul 1, 2025Updated 8 months ago
- ☆11Jan 18, 2025Updated last year