[ECCV2024] The official implementation of "Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation".
☆13Feb 24, 2025Updated last year
Alternatives and similar repositories for CSTS
Users that are interested in CSTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Align…☆19Apr 5, 2024Updated last year
- HD-EPIC Python script to download the entire datasets or parts of it☆18Oct 7, 2025Updated 5 months ago
- [BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for…☆32Feb 22, 2025Updated last year
- [CVPR 2024] Data and benchmark code for the EgoExoLearn dataset☆81Aug 26, 2025Updated 6 months ago
- ☆57Apr 28, 2025Updated 10 months ago
- ☆20Feb 13, 2026Updated last month
- Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos☆25Oct 1, 2024Updated last year
- Code for "Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations" (CVPR 2024 Oral)☆18Jun 23, 2024Updated last year
- ☆37May 28, 2025Updated 9 months ago
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆34Jun 12, 2023Updated 2 years ago
- ☆12Jun 11, 2025Updated 9 months ago
- Pytorch implementation for Egoinstructor at CVPR 2024☆28Dec 1, 2024Updated last year
- Sparse Neural Network Tools☆12Jul 15, 2024Updated last year
- Pruning is all you need (hopefully)☆12Sep 7, 2022Updated 3 years ago
- Official Implementation of CODE☆17Sep 26, 2024Updated last year
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- ☆13Nov 28, 2021Updated 4 years ago
- Human-centric environment representations from egocentric video☆14Feb 5, 2026Updated last month
- Comparison of method "Pruning at initialization prior to training" (Synflow/SNIP/GraSP) in PyTorch☆17May 12, 2024Updated last year
- ☆17Dec 4, 2024Updated last year
- Source code for the Paper "Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models"☆19Feb 1, 2026Updated last month
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆43Dec 7, 2024Updated last year
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- ☆24Apr 29, 2024Updated last year
- Official PyTorch code of GroundVQA (CVPR'24)☆64Sep 13, 2024Updated last year
- 🔍 Explore Egocentric Vision: research, data, challenges, real-world apps. Stay updated & contribute to our dynamic repository! Work-in-p…☆125Nov 23, 2024Updated last year
- code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection☆51May 1, 2023Updated 2 years ago
- Official Repository for "Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection" (AAAI …☆14Mar 1, 2025Updated last year
- Official Implementation of DMT: Dual Mean-Teacher in PyTorch.☆10Oct 27, 2023Updated 2 years ago
- This repository is created as part of Sebastian's Raschka's workshop- Building LLMs Ground Up.☆16Sep 14, 2024Updated last year
- Find Autism Friendly Places☆136Aug 27, 2015Updated 10 years ago
- Repository of the WACV'24 paper "Can CLIP Help Sound Source Localization?"☆34Feb 21, 2025Updated last year
- ☆11Nov 5, 2021Updated 4 years ago
- ☆14Sep 7, 2023Updated 2 years ago
- Modification to YOLO for improving Dynamic Real-Time Processing on Robotics Operating Systems for Autonomous Vehicle System☆21Feb 16, 2022Updated 4 years ago
- Inferring Body Pose in Egocentric Video via First and Second Person Interactions☆50Aug 31, 2021Updated 4 years ago
- [OpenReview] Official PyTorch Implementation for "Towards Adversarial Robustness of Bayesian Neural Network through Hierarchical Variatio…☆23Feb 15, 2022Updated 4 years ago
- [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval☆41Apr 11, 2025Updated 11 months ago
- ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model☆16Jan 31, 2024Updated 2 years ago