reddyav1 / RoCoG-v2
RoCoG-v2 (Robot Control Gestures) is a dataset intended to support the study of synthetic-to-real and ground-to-air video domain adaptation.
β16Updated 7 months ago
Related projects β
Alternatives and complementary repositories for RoCoG-v2
- Python scripts to download Assembly101 from Google Driveβ32Updated last month
- π Explore Egocentric Vision: research, data, challenges, real-world apps. Stay updated & contribute to our dynamic repository! Work-in-pβ¦β80Updated last week
- β67Updated 10 months ago
- β77Updated 2 years ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasksβ59Updated last month
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'β31Updated last year
- SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos [CVPR 2022]β19Updated last year
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]β90Updated 4 months ago
- Pytorch code for Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning, CVPR2022.β86Updated last year
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)β30Updated 2 months ago
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"β20Updated 6 months ago
- Simple PyTorch Dataset for the EPIC-Kitchens-55 and EPIC-Kitchens-100 that handles frames and features (rgb, optical flow, and objects) fβ¦β23Updated last year
- [ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentationβ24Updated 3 weeks ago
- Future Transformer for Long-term Action Anticipation (CVPR 2022)β47Updated last year
- Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"β47Updated last year
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long β¦β61Updated 5 months ago
- Code and models for the Action Recognition benchmark of Assembly101β10Updated last year
- [NeurIPS2022] Egocentric Video-Language Pretrainingβ228Updated 6 months ago
- This repo contains the code for the recipe of the winning entry to the Ego4d VQ2D challenge at CVPR 2022.β39Updated last year
- [arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"β114Updated 3 months ago
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understandingβ75Updated last year
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)β33Updated last year
- A curated list of egocentric (first-person) vision and related area resourcesβ256Updated 3 weeks ago
- [CVPR 2022 Oral] Versatile Multi-Modal Pre-Training for Human-Centric Perceptionβ118Updated 2 years ago
- The official PyTorch implementation of the IEEE/CVF Computer Vision and Pattern Recognition (CVPR) '24 paper PREGO: online mistake detectβ¦β14Updated 3 months ago
- Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)β38Updated 3 months ago
- A repo for processing the raw hand object detections to produce releasable pickles + library for using theseβ35Updated 2 weeks ago
- β22Updated last year
- Annotations for the public release of the EPIC-KITCHENS-100 datasetβ130Updated 2 years ago
- Action Scene Graphs for Long-Form Understanding of Egocentric Videos (CVPR 2024)β29Updated 2 weeks ago