The official repo of our work "Pensieve: Retrospect-then-Compare mitigates Visual Hallucination"
☆15May 4, 2024Updated last year
Alternatives and similar repositories for Pensieve
Users that are interested in Pensieve are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ✨ [CVPR 2023] NeuralPCI: Spatio-temporal Neural Field for 3D Point Cloud Multi-frame Non-linear Interpolation☆47Jun 3, 2023Updated 2 years ago
- 💫 [CVPR 2024] LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis☆224Jun 18, 2024Updated last year
- [NeurIPS 2024] Efficient Large Multi-modal Models via Visual Context Compression☆67Feb 19, 2025Updated last year
- HGL: Hierarchical Geometry Learning for Test-time Adaptation in 3D Point Cloud Segmentation☆15Sep 13, 2024Updated last year
- ☆11May 28, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Attention-Enhanced Cross-modal Localization Between Spherical Images and Point Clouds (IEEE Sensors Journal)☆12May 1, 2023Updated 2 years ago
- 😊 [NeurIPS 2024] GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields☆23Mar 5, 2025Updated last year
- [3DV 2025] VXP: Voxel-Cross-Pixel Large-scale Image-LiDAR Place Recognition☆18Mar 18, 2025Updated last year
- [CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension☆63Apr 8, 2024Updated 2 years ago
- ☆14Dec 11, 2024Updated last year
- Multi-Sensor Place Recognition with Visual and Text Semantics☆21May 27, 2025Updated 10 months ago
- Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)☆20Nov 4, 2024Updated last year
- [CVPR 2024] GeoAuxNet: Torwards Universal 3D Representation Learning for Multi-sensor Point Clouds☆18Mar 29, 2024Updated 2 years ago
- Event-based dynamic neural radiance fields for generating eventstreams from novel viewpoints and time windows. WACV 2024.☆19Aug 8, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICLR 2025] Think Then React: Towards Unconstrained Action-to-Reaction Motion Generation☆20Mar 21, 2025Updated last year
- Official implementation of "Stereo Depth from Events Cameras: Concentrate and Focus on the Future" (CVPR 2022)☆52Jan 12, 2023Updated 3 years ago
- Implementation and evaluation of Almanac (Automaton/Logic Multi-Agent Natural Actor-Critic), an algorithm for multi-agent reinforcement l…☆10May 5, 2022Updated 3 years ago
- [EMNLP'2023 Findings] MoqaGPT, for zero-shot multimodal question answering with LLMs☆13Dec 28, 2024Updated last year
- Source code for WWW 2019 paper "Efficient Path Prediction for Semi-Supervised and Weakly Supervised Hierarchical Text Classification"☆14May 3, 2019Updated 6 years ago
- [AAAI 2024 (Oral)] Safety-MuJoCo Environments.☆11Jun 4, 2024Updated last year
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆22Jan 11, 2026Updated 3 months ago
- Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"☆11Jan 15, 2020Updated 6 years ago
- Codes for Paper <LC2: LiDAR-Camera Loop Constraints From Cross-Modal Place Recognition>.☆38Apr 8, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for "Learning Generalizable Robotic Reward Functions from "In-The-Wild" Human Videos"☆28Oct 25, 2021Updated 4 years ago
- ☆12Jan 30, 2021Updated 5 years ago
- [ECCV 2022 Oral] Perspective Transformer on 3D Lane Detection☆501Jul 2, 2025Updated 9 months ago
- (ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning☆36Aug 8, 2024Updated last year
- Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization☆16Jul 20, 2023Updated 2 years ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Feb 26, 2026Updated last month
- Robust Reinforcement Learning Benchmark☆12Sep 22, 2024Updated last year
- [NeurIPS 2022] Official code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering☆105Apr 6, 2025Updated last year
- ☆26Dec 11, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆22Jun 11, 2024Updated last year
- [CVPR 2023 Hightlight] PDPP: Projected Diffusion for Procedure Planning in Instructional Videos☆33Aug 30, 2023Updated 2 years ago
- Code for the papers "Induction of Subgoal Automata for Reinforcement Learning" (AAAI-20) and "Induction and Exploitation of Subgoal Autom…☆14Aug 15, 2023Updated 2 years ago
- [AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets☆38Aug 20, 2024Updated last year
- ☆11May 7, 2022Updated 3 years ago
- [CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allo…☆406Aug 24, 2024Updated last year
- CNN Based Image Retrieval. SoTu☆12Jan 11, 2024Updated 2 years ago