Trans4Map: Revisiting Holistic Top-down Mapping from Egocentric Images to Allocentric Semantics with Vision Transformers
☆17Oct 14, 2022Updated 3 years ago
Alternatives and similar repositories for Trans4Map
Users that are interested in Trans4Map are comparing it to the libraries listed below
Sorting:
- ☆325Nov 3, 2023Updated 2 years ago
- [ICLR2026] The code for "Interp3D: Correspondence-Aware Interpolation for Generative Textured 3D Morphing."☆24Jan 21, 2026Updated last month
- ☆11Feb 7, 2025Updated last year
- PyRobot: An Open Source Robotics Research Platform☆11Dec 17, 2021Updated 4 years ago
- [ICLR'25] Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?☆12Apr 11, 2025Updated 10 months ago
- Low-Computation Egocentric Barcode Detector for the Blind☆10Jun 9, 2017Updated 8 years ago
- [ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]☆15Jul 15, 2025Updated 7 months ago
- Offboard Occupancy Refinement with Hybrid Propagation for Autonomous Driving☆16Feb 10, 2025Updated last year
- Modular and simple vision language navigation framework☆12Aug 16, 2021Updated 4 years ago
- The Oxford RobotCar Facade dataset.☆11Jun 4, 2022Updated 3 years ago
- The official implementation of paper: Estimating Egocentric 3D Human Pose in Global Space.☆12Sep 23, 2023Updated 2 years ago
- In this codebase we establish a benchmark for egocentric user adaptation based on Ego4d.First, we start from a population model which ha…☆15Jan 16, 2025Updated last year
- [CVPR2025] The code for "Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction."☆21Oct 19, 2025Updated 4 months ago
- Tracking Multiple Deformable Objects in Egocentric Videos (CVPR 2023)☆13Apr 10, 2023Updated 2 years ago
- ☆10Nov 16, 2023Updated 2 years ago
- ☆13Jul 6, 2022Updated 3 years ago
- ☆14Jul 11, 2025Updated 7 months ago
- ☆12Nov 28, 2022Updated 3 years ago
- ☆11Mar 25, 2021Updated 4 years ago
- OpenPCDet with spconv package already included for one-step installation. Uses spconv & voxel CUDA ops from mmdetection3d repository.☆15Nov 2, 2021Updated 4 years ago
- A Simple Active-and-Adaptive Baseline for Cross-Domain 3D Semantic Segmentation☆13Dec 22, 2022Updated 3 years ago
- 🌐 A Roadmap for 3D Scene Understanding in the Wild☆23Dec 19, 2025Updated 2 months ago
- CatNet: Class Incremental 3D ConvNets for Lifelong Egocentric Gesture Recognition☆12Apr 21, 2020Updated 5 years ago
- The official implementation of "Label-efficient Semantic Scene Completion with Scribble Annotations" (IJCAI 2024)☆14Jul 27, 2024Updated last year
- Implements the loss used in A. Furnari, S. Battiato, G. M. Farinella (2018). Leveraging Uncertainty to Rethink Loss Functions and Evaluat…☆11May 22, 2019Updated 6 years ago
- Interface to stable-baselines3 APIs for training RL policies on gym-registered environments☆12Jan 24, 2024Updated 2 years ago
- Official Implementation of Frequency-enhanced Data Augmentation for Vision-and-Language Navigation (NeurIPS2023)☆14Jan 8, 2024Updated 2 years ago
- ☆15Jul 9, 2021Updated 4 years ago
- An official implementation for "OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera"☆29Nov 6, 2025Updated 4 months ago
- Official Repository for the Paper - ViT BEVSeg: A Hierarchical Transformer Network for Monocular Birds-Eye-View Segmentation☆14May 30, 2022Updated 3 years ago
- ☆15Jul 23, 2025Updated 7 months ago
- Application of object detection methods state of arts, including MobileNetSSD, Mask-RCNN,YOLO series☆14Feb 4, 2022Updated 4 years ago
- Video Benchmark Suite: Rapid Evaluation of Video Foundation Models☆15Jan 10, 2025Updated last year
- This is an sample project of EasyAR SDK 4.0. This sample shows how to use denseSpatialMap to reconstruct the environment with a mobile ph…☆14Oct 31, 2020Updated 5 years ago
- [EMNLP25 Main]The official code of "Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval"☆20Sep 12, 2025Updated 5 months ago
- ☆16Jan 1, 2023Updated 3 years ago
- NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory. CVPR 2023.☆17Jan 26, 2024Updated 2 years ago
- Python Implementation for KITTI Scan Unfolding☆16Jun 20, 2025Updated 8 months ago
- ☆19Sep 25, 2021Updated 4 years ago