z1oong/Building-Egocentric-Procedural-AI-Assistant

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/z1oong/Building-Egocentric-Procedural-AI-Assistant)

z1oong / Building-Egocentric-Procedural-AI-Assistant

Building Egocentric Procedural AI Assistant: Methods, Benchmarks, and Challenges

☆53

Alternatives and similar repositories for Building-Egocentric-Procedural-AI-Assistant

Users that are interested in Building-Egocentric-Procedural-AI-Assistant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JiaweiLian / SRA
View on GitHub
NeurIPS 2025
☆19Oct 20, 2025Updated 9 months ago
JLChen-C / OccProphet
View on GitHub
[ICLR 2025] OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework
☆60Mar 18, 2026Updated 4 months ago
AlexZou14 / CVHSSR
View on GitHub
☆18Apr 5, 2024Updated 2 years ago
TdP-2025 / TdP-2025
View on GitHub
☆12Jul 22, 2025Updated last year
OpenGVLab / vinci
View on GitHub
Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
☆93Nov 27, 2025Updated 8 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
yellow-binary-tree / ProactiveVideoQA
View on GitHub
ProactiveBench: A Comprehensive Benchmark for VideoLLM Proactive Interaction Evaluation
☆20Jan 8, 2026Updated 6 months ago
lab-sun / S2G2
View on GitHub
[RAL 2022] S2G2: Semi-Supervised Semantic Bird-Eye-View Grid-Map Generation Using a Monocular Camera for Autonomous Driving
☆11Nov 23, 2022Updated 3 years ago
fzi-forschungszentrum-informatik / anovox
View on GitHub
Multimodaler Anomalie-Detektions Benchmark für simulierte Szenarien
☆15Jul 16, 2024Updated 2 years ago
mehditeimouri-UT / Fragments-Expert
View on GitHub
Fragments-Expert is a software package for feature extraction from file fragments and classification among various file formats.
☆13Jan 16, 2024Updated 2 years ago
LIUTIGHE / BSCV-Dataset
View on GitHub
[NeurIPS'23] The official implementation of paper "Bitstream-corrupted Video Recovery: A Novel Benchmark Dataset and Method"
☆44Jul 25, 2025Updated last year
YYX-future / LA-CMFER
View on GitHub
About [MM2024] Learning with Alignments: Tackling the Inter- and Intra-domain Shifts for Cross-multidomain Facial Expression Recognition
☆16Nov 12, 2024Updated last year
Chiaraplizz / ARGO1M-What-can-a-cook
View on GitHub
☆11Jul 14, 2023Updated 3 years ago
lab-sun / C2L-PR
View on GitHub
[TIV 2025] C2L-PR: Cross-modal Camera-to-LiDAR Place Recognition via Modality Alignment and Orientation Voting.
☆20Mar 28, 2026Updated 4 months ago
MINT-SJTU / STI-Bench
View on GitHub
STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?
☆39Jan 12, 2026Updated 6 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
As-Time-Goes-By / OmniSegNet
View on GitHub
☆19Apr 11, 2026Updated 3 months ago
robert80203 / EgoPER_official
View on GitHub
The official implementation of Error Detection in Egocentric Procedural Task Videos
☆33Sep 20, 2025Updated 10 months ago
zhousheng97 / EgoTextVQA
View on GitHub
[CVPR'25] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
☆52Jun 19, 2025Updated last year
WangyiNTU / nanobot-study
View on GitHub
Master AI Agent Assistant in 3 Days. A guided study plan using nanobot (~3k lines of Python) and your own AI Socratic Tutor. Learn Archit…
☆15Feb 7, 2026Updated 5 months ago
JiaweiLian / PADetBench
View on GitHub
☆13Jun 13, 2025Updated last year
Chiaraplizz / OSNOM
View on GitHub
Official repository from the paper "Spatial Cognition from Egocentric Video: Out of Sight, Not Out of Mind"
☆17Mar 18, 2025Updated last year
PRIS-CV / AutoDriveRL
View on GitHub
☆19Jun 13, 2025Updated last year
apple / ml-streambridge
View on GitHub
☆40Nov 5, 2025Updated 8 months ago
cydiachen / MSFSR
View on GitHub
MSFSR：A Multi-Stage Face Super-Resolution with Accurate Facial Representation via Enhanced Facial Boundaries
☆12Jun 15, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ZHUANGHP / FDG
View on GitHub
Fully Decoupled Neural Network Learning Using Delayed Gradients (FDG)
☆21Jul 5, 2021Updated 5 years ago
Coobiw / IP-IQA
View on GitHub
[ICME2024, Official Code] for paper "Bringing Textual Prompt to AI-Generated Image Quality Assessment"
☆21Jul 9, 2024Updated 2 years ago
facebookresearch / egagent
View on GitHub
Code for "Agentic Very Long Video Understanding" (EGAgent) [ACL 2026 Main]
☆50Jul 1, 2026Updated 3 weeks ago
paolotron / D3G
View on GitHub
Visual Relationship Reasoning for Grasp Planning
☆19May 22, 2025Updated last year
bytedance / OHTA
View on GitHub
[CVPR2024] OHTA: One-shot Hand Avatar via Data-driven Implicit Priors
☆33Jun 14, 2024Updated 2 years ago
AndongDeng / BEAR
View on GitHub
BEAR: a new BEnchmark on video Action Recognition
☆46Apr 21, 2024Updated 2 years ago
chaitanya100100 / UniEgoMotion
View on GitHub
Code and data for UniEgoMotion (ICCV 2025)
☆63Apr 18, 2026Updated 3 months ago
dvolgyes / FSITM
View on GitHub
Feature SIMilarity Index for Tone Mapping - A perceptual image quality assessment tool
☆10Mar 6, 2018Updated 8 years ago
Donvink / Qwen2.5-VL-Finetune
View on GitHub
Fine-tune Qwen2.5-VL-7B on custom visual QA tasks using LoRA + Accelerate, supporting single/multi-GPU training on COCO 2014 dataset.
☆30Apr 28, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
nightldj / dehaze_release
View on GitHub
PyTorch code for BMVC 2018 ``Strong Baseline for Single Image Dehazing with Deep Features and Instance Normalization''
☆20Nov 5, 2018Updated 7 years ago
JulesSanchez / recoverKITTI360label
View on GitHub
Python partial re-implementation of accumuLaser in python from the KITTI360 devkits to recover label of individual pointclouds from aggre…
☆32Jan 29, 2022Updated 4 years ago
boyuzz / WMCNN-Pytorch
View on GitHub
The Pytorch reproduction of WMCNN [Aerial Image Super Resolution via Wavelet Multiscale Convolutional Neural Networks]
☆18Aug 19, 2020Updated 5 years ago
ZhouYiiFeng / CDSR
View on GitHub
Joint Learning Content and Degradation Aware Embedding for Blind Super-Resolution
☆14Oct 20, 2022Updated 3 years ago
RayYoh / Hammer
View on GitHub
[CVPR 2026] Implementation of HAMMER: Harnessing MLLMs via Cross-Modal Integration for Intention-Driven 3D Affordance Grounding
☆20Apr 30, 2026Updated 2 months ago
facebookresearch / EgoT2
View on GitHub
Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)
☆34Jun 12, 2023Updated 3 years ago
ZhenrongWang / HOI-TG
View on GitHub
[CVPR 2025 Highlight] This repo is official PyTorch implementation of End-to-End HOI Reconstruction Transformer with Graph-based Encoding…
☆25May 8, 2025Updated last year