EgoCom: A Multi-person Multi-modal Egocentric Communications Dataset
☆60Nov 23, 2020Updated 5 years ago
Alternatives and similar repositories for EgoCom-Dataset
Users that are interested in EgoCom-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated 2 years ago
- ☆21Feb 15, 2022Updated 4 years ago
- Code accompanying EGO-TOPO: Environment Affordances from Egocentric Video (CVPR 2020)☆31Aug 3, 2022Updated 3 years ago
- ☆68Sep 13, 2022Updated 3 years ago
- This repository contains the source code of the CVPR 2020 paper: "Multimodal Future Localization and Emergence Prediction for Objects in …☆34Dec 8, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Inferring Body Pose in Egocentric Video via First and Second Person Interactions☆52Aug 31, 2021Updated 4 years ago
- ☆10Jul 24, 2019Updated 6 years ago
- Example workflow for our data-centric speech benchmark☆18Jul 6, 2023Updated 2 years ago
- Anonymous ICLR Submission☆14Sep 25, 2019Updated 6 years ago
- Project website for "Telling left from right: Learning spatial correspondence between sight and sound"☆28Jun 6, 2022Updated 4 years ago
- ☆16Apr 10, 2019Updated 7 years ago
- Code for ACM MM 2023 paper - Regress Before Construct: Regress Autoencoder for Point Cloud Self-supervised Learning☆14Jan 19, 2024Updated 2 years ago
- [CVPR 2023] Egocentric Audio-Visual Object Localization☆27Jan 6, 2024Updated 2 years ago
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- Python scripts to download Assembly101 from Google Drive☆75May 16, 2026Updated last month
- An Android app that listens to conversations and determines who was speaking at any point in the conversation - a task known as speech di…☆14Apr 12, 2021Updated 5 years ago
- In this codebase we establish a benchmark for egocentric user adaptation based on Ego4d.First, we start from a population model which ha…☆15May 18, 2026Updated last month
- Code for https://arxiv.org/abs/1712.00254☆17Dec 6, 2017Updated 8 years ago
- New egocentric synthetic dataset for egocentric 3D human pose estimation☆70Jul 22, 2023Updated 2 years ago
- Multi-Target Embodied Question Answering☆26Jul 17, 2020Updated 5 years ago
- Simple baseline model for the HEAR benchmark☆23Feb 17, 2026Updated 4 months ago
- EPIC-Kitchens-100 Action Recognition baselines: TSN, TRN, TSM☆33Mar 15, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR'25] Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?☆13Apr 11, 2025Updated last year
- The official implementation of paper: Estimating Egocentric 3D Human Pose in Global Space.☆12Sep 23, 2023Updated 2 years ago
- Tracking Multiple Deformable Objects in Egocentric Videos (CVPR 2023)☆13Apr 10, 2023Updated 3 years ago
- ☆13Jul 6, 2022Updated 3 years ago
- ☆81Jan 5, 2024Updated 2 years ago
- Low-Computation Egocentric Barcode Detector for the Blind☆10Jun 9, 2017Updated 9 years ago
- Code repo for "Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection"☆17Nov 9, 2022Updated 3 years ago
- Notebooks demonstrating example applications of the cleanvision library☆17Dec 16, 2025Updated 6 months ago
- Audio-JEPA is an adaptation of the Joint-Embedding Predictive Architecture (JEPA) for self-supervised audio representation learning. Buil…☆57Apr 17, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [AAAI-24] VVS : Video-to-Video Retrieval With Irrelevant Frame Suppression☆21May 14, 2024Updated 2 years ago
- [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval☆41Apr 11, 2025Updated last year
- 🔍 Explore Egocentric Vision: research, data, challenges, real-world apps. Stay updated & contribute to our dynamic repository! Work-in-p…☆138Apr 14, 2026Updated 2 months ago
- Repository containg experiments with Extreme Learning Machines And Reservoir Computing, ELMARC.☆20May 1, 2018Updated 8 years ago
- ☆12Apr 6, 2023Updated 3 years ago
- ☆11Aug 20, 2024Updated last year
- Implements the loss used in A. Furnari, S. Battiato, G. M. Farinella (2018). Leveraging Uncertainty to Rethink Loss Functions and Evaluat…☆11May 22, 2019Updated 7 years ago