Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"
☆20Apr 20, 2023Updated 2 years ago
Alternatives and similar repositories for 3DTRL
Users that are interested in 3DTRL are comparing it to the libraries listed below
Sorting:
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆15Jul 4, 2022Updated 3 years ago
- Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers☆21Aug 2, 2024Updated last year
- Official Repository of "Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads"☆17Oct 6, 2025Updated 5 months ago
- [WIP] Code for LangToMo☆20Jun 25, 2025Updated 8 months ago
- An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment☆24Jan 9, 2025Updated last year
- This is a python library. Install with "python3 -m pip install rp" then run with "python3 -m rp" or just "rp". Requires python≥3.5☆13Feb 16, 2026Updated 3 weeks ago
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆11Feb 24, 2025Updated last year
- Code for Exploit Clues from Views: Self-Supervised and Regularized Learning for Multiview Object Recognition☆12Jun 17, 2020Updated 5 years ago
- ☆14Jun 25, 2022Updated 3 years ago
- Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos☆28Oct 27, 2025Updated 4 months ago
- Code for our ACL 2025 paper "Language Repository for Long Video Understanding"☆34Jun 17, 2024Updated last year
- ☆31Oct 27, 2022Updated 3 years ago
- ☆18Jan 4, 2024Updated 2 years ago
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆37Jan 1, 2024Updated 2 years ago
- ☆18Dec 17, 2022Updated 3 years ago
- [ICRA'24] Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning☆70Aug 4, 2024Updated last year
- WACV 2024: "PathLDM: Text conditioned Latent Diffusion Model for Histopathology"☆48Jul 7, 2024Updated last year
- 🤖 [ICLR'25] Multimodal Video Understanding Framework (MVU)☆56Jan 31, 2025Updated last year
- Code for NeurIPS 2023 paper "Active Vision Reinforcement Learning with Limited Visual Observability"☆54Oct 10, 2024Updated last year
- Environments for Active Vision Reinforcement Learning☆28Oct 10, 2024Updated last year
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆173Jun 19, 2025Updated 8 months ago
- [WACV 2024] Code for "Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders"☆25Aug 16, 2024Updated last year
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆33Nov 7, 2023Updated 2 years ago
- [ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy☆227Mar 29, 2025Updated 11 months ago
- Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"☆29Jan 13, 2026Updated last month
- This repository contains the implementation for our work "TopoDiffusionNet: A Topology-aware Diffusion Model", accepted to ICLR 2025.☆21Apr 17, 2025Updated 10 months ago
- ☆12Apr 1, 2025Updated 11 months ago
- 使用Qt+librviz+ros设计点云显示界面☆11Jan 5, 2022Updated 4 years ago
- A Tensorflow Implementation of VoxNet.☆11Aug 2, 2018Updated 7 years ago
- ☆11Dec 27, 2022Updated 3 years ago