EgoCom: A Multi-person Multi-modal Egocentric Communications Dataset
☆60Nov 23, 2020Updated 5 years ago
Alternatives and similar repositories for EgoCom-Dataset
Users that are interested in EgoCom-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- ☆21Feb 15, 2022Updated 4 years ago
- Code accompanying EGO-TOPO: Environment Affordances from Egocentric Video (CVPR 2020)☆31Aug 3, 2022Updated 3 years ago
- Code implementation for our ECCV, 2022 paper titled "My View is the Best View: Procedure Learning from Egocentric Videos"☆34Feb 5, 2024Updated 2 years ago
- This repository contains the source code of the CVPR 2020 paper: "Multimodal Future Localization and Emergence Prediction for Objects in …☆34Dec 8, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Inferring Body Pose in Egocentric Video via First and Second Person Interactions☆50Aug 31, 2021Updated 4 years ago
- Example workflow for our data-centric speech benchmark☆17Jul 6, 2023Updated 2 years ago
- Anonymous ICLR Submission☆14Sep 25, 2019Updated 6 years ago
- Project website for "Telling left from right: Learning spatial correspondence between sight and sound"☆27Jun 6, 2022Updated 3 years ago
- Integrating Human Gaze into Attention for Egocentric Activity Recognition (WACV 2021)☆25Jul 20, 2023Updated 2 years ago
- ☆16Apr 10, 2019Updated 6 years ago
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆12Nov 1, 2025Updated 4 months ago
- The official repository for the CVPR 2019 paper "Overcoming Limitations of Mixture Density Networks: A Sampling and Fitting Framework for…☆49Jan 7, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- An Android app that listens to conversations and determines who was speaking at any point in the conversation - a task known as speech di…☆14Apr 12, 2021Updated 4 years ago
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆34Jun 12, 2023Updated 2 years ago
- Multi-Target Embodied Question Answering☆26Jul 17, 2020Updated 5 years ago
- Simple baseline model for the HEAR benchmark☆23Feb 17, 2026Updated last month
- [ICLR 2019] Learning Factorized Multimodal Representations☆68Aug 4, 2020Updated 5 years ago
- [ICLR'25] Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?☆12Apr 11, 2025Updated 11 months ago
- Audio-JEPA is an adaptation of the Joint-Embedding Predictive Architecture (JEPA) for self-supervised audio representation learning. Buil…☆42Mar 19, 2026Updated last week
- A modified version of vid2vid for Speech2Video, Text2Video Paper☆36Jun 4, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for Overinterpretation paper☆19Jul 6, 2023Updated 2 years ago
- ☆13Jul 6, 2022Updated 3 years ago
- Notebooks demonstrating example applications of the cleanvision library☆17Dec 16, 2025Updated 3 months ago
- Wave-U-Net for automatic (drum) mixing☆38Mar 24, 2023Updated 3 years ago
- [AAAI-24] VVS : Video-to-Video Retrieval With Irrelevant Frame Suppression☆20May 14, 2024Updated last year
- 🔍 Explore Egocentric Vision: research, data, challenges, real-world apps. Stay updated & contribute to our dynamic repository! Work-in-p…☆125Nov 23, 2024Updated last year
- CatNet: Class Incremental 3D ConvNets for Lifelong Egocentric Gesture Recognition☆12Apr 21, 2020Updated 5 years ago
- ☆12Apr 6, 2023Updated 2 years ago
- Implements the loss used in A. Furnari, S. Battiato, G. M. Farinella (2018). Leveraging Uncertainty to Rethink Loss Functions and Evaluat…☆11May 22, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆10Aug 20, 2024Updated last year
- Deep learning for pedestrians: backpropagation in CNNs. Latex and PyTorch code to verify theoretical derivations.☆12Jun 21, 2022Updated 3 years ago
- A PyTorch implementation of Conv-TasNet☆46Nov 25, 2019Updated 6 years ago
- DCASE2020 Challenge Task 2 baseline variants☆21Apr 2, 2020Updated 5 years ago
- Collection of gym environments with support for domain randomization☆10Dec 11, 2024Updated last year
- ☆11Sep 29, 2020Updated 5 years ago
- This repo covers the implementation for Labelling unlabelled videos from scratch with multi-modal self-supervision, which learns clusters…☆117Apr 26, 2021Updated 4 years ago