[CVPR2025] Official implementation of RAM
☆27Nov 4, 2025Updated 3 months ago
Alternatives and similar repositories for RAM
Users that are interested in RAM are comparing it to the libraries listed below
Sorting:
- [ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition☆16Jul 9, 2024Updated last year
- [ICLR 2026 Oral] Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning.☆30Feb 10, 2026Updated 2 weeks ago
- ☆16Jun 30, 2025Updated 8 months ago
- Code for "ACG: Action Coherence Guidance for Flow-based Vision-Language-Action Models" (ICRA 2026)☆60Feb 21, 2026Updated last week
- ☆12Mar 28, 2025Updated 11 months ago
- Code for our paper: "Where's Waldo: Diffusion Features For Personalized Segmentation and Retrieval".☆15Feb 26, 2025Updated last year
- [CVPR 2025] Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation☆27Jul 18, 2025Updated 7 months ago
- [CVPR 2025] SAM-I2V☆35Jan 2, 2026Updated last month
- LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Driving☆22Sep 17, 2025Updated 5 months ago
- This is the source code to paper “DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation”.☆30Aug 13, 2025Updated 6 months ago
- Fast LiDAR Data Generation with Rectified Flows (ICRA 2025)☆23Aug 4, 2025Updated 6 months ago
- [IEEE RA-L 2025] The official repository for Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Reco…☆58Jun 2, 2025Updated 8 months ago
- [CoRL 2025] GC-VLN: Instruction as Graph Constraints for Training-free Vision-and-Language Navigation☆63Sep 16, 2025Updated 5 months ago
- Official implementation of "Style Transfer with Diffusion Models for Synthetic-to-Real Domain Adaptation".☆32Nov 13, 2025Updated 3 months ago
- [CVPR 2024] Action-slot: Visual Action-centric Representations for Atomic Activity Recognition in Traffic Scenes☆24Apr 28, 2025Updated 10 months ago
- [ICML 2025]"Graph World Model", Tao Feng, Yexin Wu, Guanyu Lin, Jiaxuan You☆30Sep 20, 2025Updated 5 months ago
- EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory☆62Jan 13, 2026Updated last month
- [AAAI 2025] OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving☆22Dec 24, 2024Updated last year
- code for CoRL2025 "LaDiWM: A Latent Diffusion-based World Model for Predictive Manipulation"☆45Nov 30, 2025Updated 3 months ago
- ROS driver for RICOH THETA V/Z1☆23Sep 3, 2023Updated 2 years ago
- Official code of SaliencyI2PLoc☆31Feb 20, 2025Updated last year
- [CVPR 2025 Highlight] Unlocking Generalization Power in LiDAR Point Cloud Registration☆62Jun 14, 2025Updated 8 months ago
- [ICCV 2023] The official PyTorch code for Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation☆90Sep 7, 2023Updated 2 years ago
- (unofficial)(2025)RANSAC Revisited: An Improved Algorithm for Robust Subspace Recovery under Adversarial and Noisy Corruptions☆30Aug 12, 2025Updated 6 months ago
- Simulation-based LiDAR Dataset for Long-Term Place Recognition Under Extreme Structural Changes☆31Nov 21, 2025Updated 3 months ago
- LP-OVOD: Open-Vocabulary Object Detection by Linear Probing (WACV 2024)☆29Jul 23, 2024Updated last year
- Official code for "BoMD: Bag of Multi-label Descriptors for Noisy Chest X-ray Classification"☆27Apr 11, 2024Updated last year
- SN-LiDAR: Semantic Neural Fields for Novel Space-time View LiDAR Synthesis☆31Apr 15, 2025Updated 10 months ago
- Official code for "To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition" CVPR IMW 2025☆38Oct 4, 2025Updated 4 months ago
- Pushing 3D Vision Foundation Models Towards Globally Consistent Online Reconstruction☆52Dec 16, 2025Updated 2 months ago
- Minimal code for running the Dust3r model with PyTorch☆38Mar 16, 2025Updated 11 months ago
- Official repository for the AAAI 2025 paper "Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition".☆34Oct 5, 2025Updated 4 months ago
- ☆64Sep 8, 2025Updated 5 months ago
- Segment This Thing is an efficient image segmentation models that uses a biologically-inspired foveated tokenization to reduce inference …☆55Jun 16, 2025Updated 8 months ago
- DICNet: Deep Instance-Level Contrastive Network for Double Incomplete Multi-View Multi-Label Classification☆27Jan 14, 2025Updated last year
- Official PyTorch code for the CVPR 2024 paper 'Part-aware Unified Representation of Language and Skeleton for Zero-shot Action Recognitio…☆37May 28, 2025Updated 9 months ago
- [ICME 2023, Oral] HybridPoint: Point cloud registration based on hybrid point sampling and matching☆29Mar 14, 2024Updated last year
- [CVPR2025] LightLoc: Learning Outdoor LiDAR Localization at Light Speed☆81Jul 23, 2025Updated 7 months ago
- Implementation for "DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations" (NeurIPS 2022))☆71Oct 24, 2023Updated 2 years ago