srijandas07 / clip_baseline_LTA_Ego4dView external linksLinks
Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)
☆15Jul 4, 2022Updated 3 years ago
Alternatives and similar repositories for clip_baseline_LTA_Ego4d
Users that are interested in clip_baseline_LTA_Ego4d are comparing it to the libraries listed below
Sorting:
- Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers☆21Aug 2, 2024Updated last year
- Official Repository of "Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads"☆17Oct 6, 2025Updated 4 months ago
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Apr 20, 2023Updated 2 years ago
- This is the offical repository of LLAVIDAL☆23Oct 4, 2025Updated 4 months ago
- Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…☆13Apr 11, 2023Updated 2 years ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆13Nov 4, 2023Updated 2 years ago
- ☆18Dec 17, 2022Updated 3 years ago
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆71Jan 29, 2024Updated 2 years ago
- [WIP] Code for LangToMo☆20Jun 25, 2025Updated 7 months ago
- Simple PyTorch Dataset for the EPIC-Kitchens-55 and EPIC-Kitchens-100 that handles frames and features (rgb, optical flow, and objects) f…☆24Jan 22, 2023Updated 3 years ago
- ☆19Sep 10, 2021Updated 4 years ago
- Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)☆46Jul 26, 2024Updated last year
- Pose driven attention mechanism☆45Mar 31, 2022Updated 3 years ago
- PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations☆17Apr 25, 2020Updated 5 years ago
- An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment☆23Jan 9, 2025Updated last year
- WACV 2024: "PathLDM: Text conditioned Latent Diffusion Model for Histopathology"☆48Jul 7, 2024Updated last year
- Code for the Paper: Antonino Furnari and Giovanni Maria Farinella. What Would You Expect? Anticipating Egocentric Actions with Rolling-Un…☆134Aug 23, 2023Updated 2 years ago
- Environments for Active Vision Reinforcement Learning☆28Oct 10, 2024Updated last year
- Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?☆28Sep 23, 2024Updated last year
- ☆24Mar 24, 2023Updated 2 years ago
- [WACV 2024] Code for "Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders"☆25Aug 16, 2024Updated last year
- [NeurIPS 2022] Egocentric Video-Language Pretraining☆254May 9, 2024Updated last year
- [CVPR2022 Oral] VISOLO: Grid-Based Space-Time Aggregation for Efficient Online Video Instance Segmentation☆29Jul 19, 2022Updated 3 years ago
- ☆32Feb 10, 2023Updated 3 years ago
- Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors☆31Jun 2, 2024Updated last year
- Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset☆533Feb 4, 2026Updated 2 weeks ago
- Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022☆72Nov 1, 2024Updated last year
- Code for our ACL 2025 paper "Language Repository for Long Video Understanding"☆34Jun 17, 2024Updated last year
- [CVPR 2023 Hightlight] PDPP: Projected Diffusion for Procedure Planning in Instructional Videos☆33Aug 30, 2023Updated 2 years ago
- Implementation of "EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition, ICCV, 2019" in PyTorch☆112Jan 25, 2021Updated 5 years ago
- Code for the paper: Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation.☆32Aug 15, 2023Updated 2 years ago
- [Main Conference @ EACL'26] [Workshop @ NeurIPS'24] 🎞️ LVNet.☆42Feb 10, 2026Updated last week
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆33Nov 7, 2023Updated 2 years ago
- [CVPR 2022] Egocentric Action Target Prediction in 3D☆32Dec 2, 2025Updated 2 months ago
- EPIC-Kitchens-100 Action Recognition baselines: TSN, TRN, TSM☆33Mar 15, 2022Updated 3 years ago
- ☆30Dec 16, 2018Updated 7 years ago
- 서울시 열섬현상 완화를 위한 녹지 및 바람길 입지 선정☆18Dec 29, 2019Updated 6 years ago
- [ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy☆227Mar 29, 2025Updated 10 months ago
- MetaC provides a read-eval-print loop (a REPL) and notebook interactive development environment (a NIDE) for C programming. MetaC also …☆12Feb 10, 2026Updated last week