983632847 / Awesome-Multimodal-Object-TrackingLinks
A continuously updated project to track the latest progress in the field of multi-modal object tracking. This project focuses solely on single-object tracking.
☆902Updated last week
Alternatives and similar repositories for Awesome-Multimodal-Object-Tracking
Users that are interested in Awesome-Multimodal-Object-Tracking are comparing it to the libraries listed below
Sorting:
- R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement Learning.☆66Updated 7 months ago
- A fast gigapixel processing system☆2,008Updated last year
- 🔥 A unified system resource management platform designed for administrators, serving as the foundational module for the Angus applicatio…☆1,072Updated last week
- SCoralDet and SCoralDet Dataset☆129Updated 3 months ago
- 🔥 An agile development and testing platform designed to empower small and medium-sized enterprises to build their own R&D systems, strea…☆3,379Updated this week
- 🔥 JMock is a high-performance data generation and simulation component library implemented in Java.☆422Updated last month
- Tutorial for deep learning(AIGC)☆124Updated 3 weeks ago
- 🔥 AngusInfra is a foundational framework for rapidly developing multi-tenant web applications, built on the Enterprise-level development…☆547Updated this week
- Repository of AudioGenie☆231Updated 2 months ago
- 全语言制品仓库,涵盖npm、Maven、PyPi、Docker、Gradle、SBT、Cocoapods、Swift、RPM、Debian、PHP、Go、Pub、Ivy、NuGet、Conda、Cargo、Conan、Yarn、GitLFS、Helm、OHPM等主流工具,涵…☆3,702Updated last week
- (ICLR 2025) The official pytorch implementation of "UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation"☆31Updated 8 months ago
- 🔥 OpenAPIDesigner is an open-source OpenAPI specification design tool that allows developers to design, write, and validate OpenAPI spec…☆568Updated last month
- This is the official repository for C3-OWD: A Curriculum Cross-modal Contrastive Learning Framework for Open-World Detection☆156Updated 2 months ago
- [IEEE TASE 2025] The Official Implementation for ''Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Clo…☆91Updated 7 months ago
- AIDoctor training medical GPT model with ChatGPT training pipeline, implemantation of Pretraining, Supervised Finetuning, RLHF(Reward Mod…☆275Updated 9 months ago
- Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos☆304Updated 2 months ago
- ☆926Updated 4 months ago
- Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion☆297Updated this week
- [CVPR‘ 2025 ] JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration☆248Updated last week
- ☆355Updated 3 weeks ago
- codebase for iccv 2025 paper "One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory"☆123Updated 4 months ago
- EvoVLA: Self-Evolving Vision-Language-Action Model☆211Updated 2 weeks ago
- [Survey] Towards Efficient Large Language Model Serving: A Survey on System-Aware KV Cache Optimization☆198Updated last week
- Official Repo of "PreResQ-R1: Towards Fine-Grained Rank-and-Score Reinforcement Learning for Image Quality Assessment via Preference–Resp…☆117Updated last month
- Just having comparing hybrid ResNet50+ViT models with pure ResNet18 CNN on a mixed dataset! Wanted to see how these different architectur…☆22Updated 2 weeks ago
- BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence☆238Updated 6 months ago
- CausalVLR: A Toolbox and Benchmark for Vision-Language Causal Reasoning (多模态 因果推理开源框架)☆1,148Updated 2 months ago
- 🧩 IMAGHarmony 🧩: Controllable image editing with consistent object quantity and layout. A structure-aware framework that ensures high f…☆678Updated 2 months ago
- A lightweight and extensible toolkit for visualizing attention flow in Large Vision-Language Models (LVLMs). It renders token-to-token at…☆130Updated last week
- ☆35Updated 8 months ago