facebookresearch / projectaria_eyetrackingLinks
Project Aria Social Eye Tracking Model
☆41Updated 7 months ago
Alternatives and similar repositories for projectaria_eyetracking
Users that are interested in projectaria_eyetracking are comparing it to the libraries listed below
Sorting:
- ☆127Updated 10 months ago
- Aria Training and Evaluation Kit☆27Updated 4 months ago
- Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model☆67Updated 6 months ago
- Program synthesis for 3D spatial reasoning☆42Updated last month
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding☆76Updated last year
- ☆39Updated 6 months ago
- Official implementation of "Self-Correcting Self-Consuming Loops for Generative Model Training" (ICML 2024)☆33Updated 11 months ago
- SATO: Stable Text-to-Motion Framework☆113Updated 5 months ago
- Dynadiff: Single-stage Decoding of Images from Continuously Evolving fMRI☆38Updated 2 months ago
- Diffusion Models as Data Mining Tools☆54Updated 2 months ago
- PyTorch Implementation of "ASTRA: An Action Spotting TRAnsformer for Soccer Videos", ACM MMSports 2023. | 3rd place solution for SoccerNe…☆40Updated last year
- Official implementation of the paper "EgoPet: Egomotion and Interaction Data from an Animal's Perspective".☆26Updated last year
- Visualisation of VISOR Segmentations with Annotations and Relations☆21Updated 2 years ago
- ☆78Updated 9 months ago
- Graph learning framework for long-term video understanding☆65Updated this week
- Clarity: A Minimalist Website Template for AI Research☆127Updated 6 months ago
- [NeurIPS 2024 D&B] VideoGUI: A Benchmark for GUI Automation from Instructional Videos☆40Updated last month
- (CVPR 2023) Seeing a Rose in Five Thousand Ways☆118Updated 2 years ago
- Action Scene Graphs for Long-Form Understanding of Egocentric Videos (CVPR 2024)☆41Updated 3 months ago
- Data release for Step Differences in Instructional Video (CVPR24)☆14Updated last year
- This repo contains the code for the paper "Intuitive physics understanding emerges fromself-supervised pretraining on natural videos"☆169Updated 4 months ago
- This repository contains a command-line interface(CLI) that can detect and blur out faces and license plates(PII) from images and videos…☆135Updated 9 months ago
- Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation☆29Updated 2 weeks ago
- Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurI…☆90Updated last year
- Induce brain-like topographic structure in your neural networks☆62Updated last month
- Multi-model video-to-text by combining embeddings from Flan-T5 + CLIP + Whisper + SceneGraph. The 'backbone LLM' is pre-trained from scra…☆52Updated 2 years ago
- SMILE: A Multimodal Dataset for Understanding Laughter☆13Updated 2 years ago
- Aria data tools provide the open-source toolkit in C++ and Python to interact with data from Project Aria☆119Updated 2 weeks ago
- This repository holds the "Fully automated landmarking and facial segmentation on 3D photographs" files