983632847 / Awesome-Multimodal-Object-TrackingLinks
A continuously updated project to track the latest progress in the field of multi-modal object tracking. This project focuses solely on single-object tracking.
☆929Updated last week
Alternatives and similar repositories for Awesome-Multimodal-Object-Tracking
Users that are interested in Awesome-Multimodal-Object-Tracking are comparing it to the libraries listed below
Sorting:
- R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement Learning.☆67Updated 8 months ago
- SCoralDet and SCoralDet Dataset☆129Updated 4 months ago
- A fast gigapixel processing system☆2,008Updated last year
- Repository of AudioGenie☆235Updated 2 months ago
- (ICLR 2025) The official pytorch implementation of "UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation"☆34Updated this week
- Tutorial for deep learning(AIGC)☆124Updated last month
- AIDoctor training medical GPT model with ChatGPT training pipeline, implemantation of Pretraining, Supervised Finetuning, RLHF(Reward Mod…☆275Updated 10 months ago
- Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos☆306Updated 3 months ago
- 🔥 JMock is a high-performance data generation and simulation component library implemented in Java.☆422Updated 2 months ago
- 🔥 An agile development and testing platform designed to empower small and medium-sized enterprises to build their own R&D systems, strea…☆3,376Updated 3 weeks ago
- 🔥 AngusInfra is a foundational framework for rapidly developing multi-tenant web applications, built on the Enterprise-level development…☆548Updated this week
- [CVPR‘ 2025 ] JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration☆249Updated last month
- This is the official repository for C3-OWD: A Curriculum Cross-modal Contrastive Learning Framework for Open-World Detection☆157Updated 3 months ago
- ☆926Updated 4 months ago
- 🔥 OpenAPIDesigner is an open-source OpenAPI specification design tool that allows developers to design, write, and validate OpenAPI spec…☆568Updated 2 months ago
- [IEEE TASE 2025] The Official Implementation for ''Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Clo…☆108Updated last week
- ☆355Updated last month
- Moxin is a family of fully open-source and reproducible LLMs☆620Updated 6 months ago
- BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence☆239Updated 7 months ago
- 全语言制品仓库,涵盖npm、Maven、PyPi、Docker、Gradle、SBT、Cocoapods、Swift、RPM、Debian、PHP、Go、Pub、Ivy、NuGet、Conda、Cargo、Conan、Yarn、GitLFS、Helm、OHPM等主流工具,涵…☆4,360Updated 3 weeks ago
- Embodied Intelligence in Endovascular Robot Navigation -- 血管介入手术机器人具身导航☆23Updated last week
- csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and App…☆1,488Updated this week
- CausalVLR: A Toolbox and Benchmark for Vision-Language Causal Reasoning (多模态因果推理开源框架)☆1,152Updated 3 months ago
- Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression☆95Updated last year
- PageEyes Agent 是一个轻量级 UI Agent,通过自然语言指令驱动,无需编写脚本既可实现Web、Android平台的UI自动化任务。☆697Updated this week
- A high-performance IM server.☆4,140Updated last week
- Official implementation of ''Pixel-inconsistency modeling for image manipulation localization''☆143Updated 3 months ago
- 生产级iOS网络通信、架构实战 基于 CocoaAsyncSocket 打造的高性能底层通信框架,日均处理万级别消息,真实服务于企业客户!来源于多年IM开发经验总结,经过生产环境验证(已脱敏),完整呈现从单TCP架构到企业级多路复用架构的演进之路。☆750Updated 4 months ago
- EvoVLA: Self-Evolving Vision-Language-Action Model☆224Updated 2 weeks ago
- EDA-Q is a full-stack electronic design automation (EDA) tool for quantum chip design, supporting both superconducting and trapped-ion qu…☆546Updated last week