Building Egocentric Procedural AI Assistant: Methods, Benchmarks, and Challenges
☆49Feb 10, 2026Updated 4 months ago
Alternatives and similar repositories for Building-Egocentric-Procedural-AI-Assistant
Users that are interested in Building-Egocentric-Procedural-AI-Assistant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jul 22, 2025Updated 10 months ago
- The theme of USTB report.☆10Dec 18, 2018Updated 7 years ago
- ☆28Jun 12, 2025Updated last year
- Code for the paper "SMACE: A New Method for the Interpretability of Composite Decision Systems", ECML 2022☆15Apr 17, 2023Updated 3 years ago
- ☆11Jul 14, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for the paper "Attention Meets Post-hoc Interpretability: A Mathematical Perspective", ICML 2024☆22Nov 10, 2025Updated 7 months ago
- Visual Relationship Reasoning for Grasp Planning☆19May 22, 2025Updated last year
- [CVPR 2026 Oral] "MARCO: Navigating the Unseen Space of Semantic Correspondence"☆135Apr 21, 2026Updated last month
- Code and data for UniEgoMotion (ICCV 2025)☆57Apr 18, 2026Updated last month
- [ICME2024, Official Code] for paper "Bringing Textual Prompt to AI-Generated Image Quality Assessment"☆21Jul 9, 2024Updated last year
- Official Repository for "Communication Efficient Federated Learning with Generalized Heavy-Ball Momentum", accepted at TMLR 2025☆28Jul 14, 2025Updated 11 months ago
- Code for the paper "Differentiable Task Graph Learning: Procedural Activity Representation and Online Mistake Detection from Egocentric V…☆23Jan 9, 2025Updated last year
- Official implementation of EgoHOD at ICLR 2025; 14 EgoVis Challenge Winners in CVPR 2024☆33Nov 25, 2025Updated 6 months ago
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆34Jun 12, 2023Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2…☆24Jun 13, 2024Updated 2 years ago
- Multimodal Open Source Framework for Conversational Agent Research and Development.☆27Feb 16, 2025Updated last year
- Chain-of-Frames [CVPR 2026]☆40Jul 2, 2025Updated 11 months ago
- ☆19May 26, 2026Updated 2 weeks ago
- ☆19Sep 5, 2024Updated last year
- ReLaX-VQA: Residual Fragment and Layer Stack Extraction for Enhancing Video Quality Assessment☆20Jul 21, 2025Updated 10 months ago
- Official code releasse for "The Invisible EgoHand: 3D Hand Forecasting through EgoBody Pose Estimation"☆32Aug 19, 2025Updated 9 months ago
- Processing monocular face videos☆17Apr 10, 2026Updated 2 months ago
- A tools to estimate SMPL-X parameters (with T pose) from FLAME meshes (or FLAME parameters)☆22Nov 21, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆24Apr 7, 2026Updated 2 months ago
- Evaluation code for "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation"☆18Mar 10, 2024Updated 2 years ago
- ☆22Jul 1, 2024Updated last year
- calculate the sparse optical flow using pyramid Lucas-Kanade and sum of differences (SAD) matching☆13Aug 8, 2017Updated 8 years ago
- [Information Fusion 2025] A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective☆612Jun 8, 2026Updated last week
- End-to-End AI Voice Assistant pipeline with Whisper for Speech-to-Text, Hugging Face LLM for response generation, and Edge-TTS for Text-t…☆27Apr 15, 2026Updated 2 months ago
- Code for our CVPR 2023 paper "MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot Action Recognition".☆52Mar 7, 2024Updated 2 years ago
- [ICLR 2025] OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework☆59Mar 18, 2026Updated 2 months ago
- The official pytorch code for Expressive 3D Facial Animation Generation Based on Local-to-global Latent Diffusion☆35Oct 25, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆31Jun 30, 2025Updated 11 months ago
- ☆27Jul 10, 2025Updated 11 months ago
- [BMVC 2023] 3D Structure-guided Network for Tooth Alignment in 2D Photograph☆31Mar 12, 2025Updated last year
- UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts☆41Jun 12, 2025Updated last year
- easy and simple way to train, export and deploy pointpillars for 3D detection☆63Aug 12, 2021Updated 4 years ago
- ☆80Mar 7, 2024Updated 2 years ago
- Official Access to ICIP2024 "THQA: A Perceptual Quality Assessment Database for Talking Heads"☆38Jul 23, 2025Updated 10 months ago