ligengen/EgoM2P

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ligengen/EgoM2P)

ligengen / EgoM2P

[ICCV 2025] The official implementation for EgoM2P: Egocentric Multimodal Multitask Pretraining.

☆38

Alternatives and similar repositories for EgoM2P

Users that are interested in EgoM2P are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Arking1995 / COHO
View on GitHub
[ECCV 2024 Oral] The official implementation of paper: COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation
☆13Aug 13, 2024Updated last year
ShenhanQian / slurman
View on GitHub
☆13Mar 20, 2026Updated 4 months ago
michaelyuancb / egomono4d
View on GitHub
Official Reporsitory of "EgoMono4D: Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos"
☆48Sep 23, 2025Updated 10 months ago
Biscue5 / EgoScaler
View on GitHub
[CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision
☆48Dec 2, 2025Updated 7 months ago
xrkong / skimba
View on GitHub
Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion
☆12Jan 14, 2026Updated 6 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ChanganVR / action2sound
View on GitHub
Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos
☆26Oct 1, 2024Updated last year
ethz-vlg / mvtracker
View on GitHub
[ICCV 2025 Oral] MVTracker: Multi-view 3D Point Tracking
☆512Nov 3, 2025Updated 8 months ago
Sangluisme / 4Deform
View on GitHub
4Deform: Neural Surface Deformation for Robust Shape Interpolation
☆26Dec 4, 2025Updated 7 months ago
EvolvingLMMs-Lab / EgoLife
View on GitHub
[CVPR 2025] EgoLife: Towards Egocentric Life Assistant
☆452Mar 19, 2025Updated last year
haruolabs / style-nerf2nerf
View on GitHub
Style-NeRF2NeRF implementation.
☆14Dec 26, 2024Updated last year
hd-epic / hd-epic-downloader
View on GitHub
HD-EPIC Python script to download the entire datasets or parts of it
☆24Oct 7, 2025Updated 9 months ago
jzr99 / Geo4D
View on GitHub
[ICCV 2025 Highlight] Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction
☆437Jun 6, 2025Updated last year
zgchen33 / LONG3R
View on GitHub
[ICCV2025] LONG3R: Long Sequence Streaming 3D Reconstruction
☆44Jul 25, 2025Updated last year
tum-vision / scenedino
View on GitHub
Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion (ICCV 2025)
☆89Sep 18, 2025Updated 10 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
facebookresearch / egoman
View on GitHub
The repository provides code for EgoMAN model and dataset creation scripts.
☆32Dec 31, 2025Updated 6 months ago
Chiaraplizz / OSNOM
View on GitHub
Official repository from the paper "Spatial Cognition from Egocentric Video: Out of Sight, Not Out of Mind"
☆17Mar 18, 2025Updated last year
ZechuanLi / GO-N3RDet
View on GitHub
[CVPR 2025] GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector
☆16Mar 19, 2025Updated last year
LaVi-Lab / EgoMask
View on GitHub
[ICCV 2025] "Fine-grained Spatiotemporal Grounding on Egocentric Videos"
☆27Jul 3, 2026Updated 3 weeks ago
byeongjun-park / SteerX
View on GitHub
[ICCV 2025] Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"
☆52Mar 20, 2025Updated last year
UMass-Embodied-AGI / ActionImages
View on GitHub
☆71Jun 9, 2026Updated last month
zhousheng97 / EgoTextVQA
View on GitHub
[CVPR'25] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
☆52Jun 19, 2025Updated last year
chaitanya100100 / UniEgoMotion
View on GitHub
Code and data for UniEgoMotion (ICCV 2025)
☆63Apr 18, 2026Updated 3 months ago
xiac20 / ScenePainter
View on GitHub
[ICCV'25] ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment
☆37Oct 5, 2025Updated 9 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ou524u / Less3Depend
View on GitHub
[ICLR 2026] PyTorch implementation of "The Less You Depend, The More You Learn: Synthesizing Novel Views from Sparse, Unposed Images with…
☆66May 13, 2026Updated 2 months ago
Xiaohao-Xu / Ambiguity-in-Space
View on GitHub
[ECCV 2026 Oral] One Scene, Two Depths: Probing Geometric Ambiguity in Monocular Foundation Models (Layered 3D Spatial Understanding)
☆23Jul 10, 2026Updated 2 weeks ago
dexwild / dexwild
View on GitHub
DexWild: Dexterous Human Interactions for In-the-Wild Robot Policies
☆45Aug 14, 2025Updated 11 months ago
mattdeitke / objaverse-xl-test-files
View on GitHub
☆12Sep 11, 2023Updated 2 years ago
chobao / Free360
View on GitHub
Code for "Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views", CVPR 2025
☆51Jul 7, 2025Updated last year
RAIVNLab / VideoNet
View on GitHub
CVPR '26 Highlight
☆24May 6, 2026Updated 2 months ago
algvr / maple
View on GitHub
MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…
☆34Dec 9, 2025Updated 7 months ago
facebookresearch / nymeria_dataset
View on GitHub
Official repo for Nymeria and NymeriaPlus datasets.
☆235Jul 22, 2026Updated last week
zju3dv / EgoAgent
View on GitHub
Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".
☆53Jun 30, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
HavenFeng / St4RTrack
View on GitHub
Official Implementation of paper "St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World"
☆143Sep 18, 2025Updated 10 months ago
ysbsb / awesome-quantization
View on GitHub
Awesome Quantization Paper lists with Codes
☆10Feb 24, 2021Updated 5 years ago
ruili3 / lari
View on GitHub
[ICML 2026] 🎨 Occluded 3D Scene Reconstruction from a Single Image.
☆94Jun 9, 2026Updated last month
continental / seed4d
View on GitHub
[WACV 2025] Official code of "SEED4D: A Synthetic Ego-Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark"
☆24Sep 3, 2025Updated 10 months ago
Z1hanW / MonoFusion
View on GitHub
(ICCV 2025) MonoFusion
☆71Mar 22, 2026Updated 4 months ago
facebookresearch / DepthLM_Official
View on GitHub
[ICLR 2026 Oral (top 1.2%)] Official implementation of DepthLM
☆363Jun 1, 2026Updated last month
thechargedneutron / FIction
View on GitHub
Code implementation of the paper 'FIction: 4D Future Interaction Prediction from Video'
☆21Mar 19, 2025Updated last year