Xuan-World / Mamba-YOLO-WorldView external linksLinks
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection
☆95Mar 12, 2025Updated 11 months ago
Alternatives and similar repositories for Mamba-YOLO-World
Users that are interested in Mamba-YOLO-World are comparing it to the libraries listed below
Sorting:
- YOLO-UniOW: Efficient Universal Open-World Object Detection☆176Jan 17, 2025Updated last year
- [ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"☆90Dec 23, 2025Updated last month
- Implementation of YOLO and IOU tracker in C++☆18Dec 20, 2021Updated 4 years ago
- (CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of La…☆556Feb 4, 2026Updated last week
- OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion☆400Mar 12, 2025Updated 11 months ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆17Nov 4, 2025Updated 3 months ago
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆49Jan 30, 2026Updated 2 weeks ago
- ☆12Nov 13, 2024Updated last year
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Dec 13, 2024Updated last year
- [WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"☆84Jan 2, 2026Updated last month
- Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning☆45Jul 2, 2025Updated 7 months ago
- [TCSVT] state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆32Jun 3, 2025Updated 8 months ago
- (NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection☆123Apr 26, 2024Updated last year
- ☆13Jul 30, 2024Updated last year
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆16Oct 27, 2024Updated last year
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception☆151Jan 10, 2026Updated last month
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆62Apr 7, 2024Updated last year
- ☆35Nov 25, 2025Updated 2 months ago
- ☆15Dec 11, 2024Updated last year
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- YOLO-World-ONNX is a Python package for running inference on YOLO-WORLD Open-vocabulary-object detection model using ONNX models. It prov…☆15Feb 6, 2026Updated last week
- Code and Data for "FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation" (ACL25)☆29Oct 26, 2025Updated 3 months ago
- UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer☆123Jun 27, 2025Updated 7 months ago
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated 9 months ago
- Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future☆215Apr 3, 2025Updated 10 months ago
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆47Jul 17, 2025Updated 6 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Aug 4, 2024Updated last year
- Code release for "Weakly Supervised Open-Vocabulary Object Detection", AAAI2024☆35Sep 9, 2024Updated last year
- [CVPR 2024] Real-Time Open-Vocabulary Object Detection☆6,208Feb 26, 2025Updated 11 months ago
- [ACL 2025] Squeezed Attention: Accelerating Long Prompt LLM Inference☆56Nov 20, 2024Updated last year
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆19Mar 4, 2025Updated 11 months ago
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆131Mar 22, 2025Updated 10 months ago
- Source code of our EMNLP 2024 paper "FactAlign: Long-form Factuality Alignment of Large Language Models"☆19Oct 3, 2024Updated last year
- [ICME 2025] DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation☆24Mar 25, 2025Updated 10 months ago
- ☆21Aug 25, 2024Updated last year
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models☆21Jan 29, 2025Updated last year
- ☆41Jan 10, 2025Updated last year
- A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…☆342Nov 6, 2025Updated 3 months ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆23Mar 13, 2025Updated 11 months ago