Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"
☆20Apr 20, 2023Updated 3 years ago
Alternatives and similar repositories for 3DTRL
Users that are interested in 3DTRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆15Jul 4, 2022Updated 3 years ago
- Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers☆21Aug 2, 2024Updated last year
- This is a python library. Install with "python3 -m pip install rp" then run with "python3 -m rp" or just "rp". Requires python≥3.5☆13Apr 29, 2026Updated last week
- [WIP] Code for LangToMo☆21Mar 19, 2026Updated last month
- Official Repository of "Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads"☆17Oct 6, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is the offical repository of LLAVIDAL☆24Oct 4, 2025Updated 7 months ago
- An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment☆25Jan 9, 2025Updated last year
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆11Feb 24, 2025Updated last year
- Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos☆28Oct 27, 2025Updated 6 months ago
- Code for our ACL 2025 paper "Language Repository for Long Video Understanding"☆36Jun 17, 2024Updated last year
- [Main Conference @ EACL'26] [Workshop @ NeurIPS'24] 🎞️ LVNet.☆43Feb 10, 2026Updated 2 months ago
- Code for Exploit Clues from Views: Self-Supervised and Regularized Learning for Multiview Object Recognition☆12Jun 17, 2020Updated 5 years ago
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆37Jan 1, 2024Updated 2 years ago
- ☆18Dec 17, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆31Oct 27, 2022Updated 3 years ago
- 🤖 [ICLR'25] Multimodal Video Understanding Framework (MVU)☆56Jan 31, 2025Updated last year
- [WACV 2024] Code for "Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders"☆25Aug 16, 2024Updated last year
- Tools for Toyota Smarthome datasets☆14Nov 16, 2022Updated 3 years ago
- ☆18Jan 4, 2024Updated 2 years ago
- [NeurIPS 2023] Self-supervised Object-Centric Learning for Videos☆32Nov 28, 2024Updated last year
- ☆20Mar 10, 2025Updated last year
- Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise☆42Oct 7, 2025Updated 7 months ago
- Pytorch I3D implmentation on Toyota Smarthome Dataset☆17Apr 23, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆175Jun 19, 2025Updated 10 months ago
- Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors☆31Jun 2, 2024Updated last year
- [ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy☆229Mar 29, 2025Updated last year
- Official repository for "Boosting Adversarial Transferability using Dynamic Cues " (ICLR 2023)☆20Aug 24, 2023Updated 2 years ago
- Menagerie of video models trained on various video datasets☆10Oct 13, 2024Updated last year
- This code is provided for reproducibility of results in the paper: Multiview Aerial Visual Recognition (MAVREC): Can Multi-view Improve A…☆23Feb 6, 2025Updated last year
- [AAAI 2025] Official Repository of 'SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living'☆23Sep 17, 2025Updated 7 months ago
- ☆14Nov 28, 2022Updated 3 years ago
- [ICLR'25] Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?☆12Apr 11, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- dist☆10Dec 14, 2018Updated 7 years ago
- [ECCV'24] 3D Reconstruction of Objects in Hands without Real World 3D Supervision☆17Feb 3, 2025Updated last year
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆97May 21, 2023Updated 2 years ago
- [CVPR 2022] Sequential Voting with Relational Box Fields for Active Object Detection☆10Jun 19, 2022Updated 3 years ago
- [CVPR 2026] Official Repository of 'MS-Temba: Multi-Scale Temporal Mamba for Understanding Long Untrimmed Videos'☆44Jan 23, 2026Updated 3 months ago
- [ACM MM 2024] Frequency Guidance Matters: Skeletal Action Recognition by Frequency-Aware Mixed Transformer☆21Apr 28, 2026Updated last week
- ☆13Sep 23, 2021Updated 4 years ago