OpenGVLab/vinci

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenGVLab/vinci)

OpenGVLab / vinci

Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model

☆93

Alternatives and similar repositories for vinci

Users that are interested in vinci are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CG-Bench / CG-Bench
View on GitHub
☆20Jan 26, 2025Updated last year
ayiyayi / EgoExoBench
View on GitHub
☆15Nov 13, 2025Updated 8 months ago
Jazzcharles / Egoinstructor
View on GitHub
Pytorch implementation for Egoinstructor at CVPR 2024
☆28Dec 1, 2024Updated last year
InternRobotics / EgoThinker
View on GitHub
Official implementation of EgoThinker at NIPS 2025
☆29Nov 25, 2025Updated 7 months ago
InternRobotics / EgoHOD
View on GitHub
Official implementation of EgoHOD at ICLR 2025; 14 EgoVis Challenge Winners in CVPR 2024
☆35Nov 25, 2025Updated 7 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
OpenGVLab / EgoExoLearn
View on GitHub
[CVPR 2024] Data and benchmark code for the EgoExoLearn dataset
☆85Aug 26, 2025Updated 10 months ago
OpenGVLab / EgoVideo
View on GitHub
[CVPR 2024 Champions][ICLR 2025] Solutions for EgoVis Chanllenges in CVPR 2024
☆136May 11, 2025Updated last year
WPR001 / Ego-ST
View on GitHub
☆16Sep 25, 2025Updated 9 months ago
Lyman-Smoker / Awesome_Ego_Action
View on GitHub
A curated list of Egocentric Action Understanding resources
☆59Nov 26, 2025Updated 7 months ago
Echo0125 / MAT-Memory-and-Anticipation-Transformer
View on GitHub
[ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding
☆50Oct 7, 2023Updated 2 years ago
EvolvingLMMs-Lab / EgoLife
View on GitHub
[CVPR 2025] EgoLife: Towards Egocentric Life Assistant
☆447Mar 19, 2025Updated last year
iSEE-Laboratory / EgoExo-Fitness
View on GitHub
(ECCV 2024) Official repository of paper "EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding"
☆38Apr 8, 2025Updated last year
zihuixue / AlignEgoExo
View on GitHub
Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Align…
☆19Apr 5, 2024Updated 2 years ago
ayiyayi / Awesome-Egocentric-and-Exocentric-Vision
View on GitHub
☆40Nov 14, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
egolife-ai / Ego-R1
View on GitHub
[TPAMI 2026] Ego-R1: Agentic Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning
☆165Jun 10, 2026Updated last month
Mark12Ding / Dispider
View on GitHub
[CVPR 2025]Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction
☆180Mar 23, 2025Updated last year
sapeirone / EgoPack
View on GitHub
Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2…
☆24Jun 13, 2024Updated 2 years ago
z1oong / Building-Egocentric-Procedural-AI-Assistant
View on GitHub
Building Egocentric Procedural AI Assistant: Methods, Benchmarks, and Challenges
☆52Feb 10, 2026Updated 5 months ago
EgocentricVision / EgocentricVision
View on GitHub
🔍 Explore Egocentric Vision: research, data, challenges, real-world apps. Stay updated & contribute to our dynamic repository! Work-in-p…
☆139Apr 14, 2026Updated 3 months ago
Becomebright / GroundVQA
View on GitHub
Official PyTorch code of GroundVQA (CVPR'24)
☆63Sep 13, 2024Updated last year
thechargedneutron / FIction
View on GitHub
Code implementation of the paper 'FIction: 4D Future Interaction Prediction from Video'
☆21Mar 19, 2025Updated last year
daeunni / StreamGaze
View on GitHub
Code for "StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos"
☆26May 13, 2026Updated 2 months ago
mll-lab-nu / TStar
View on GitHub
TStar is a unified temporal search framework for long-form video question answering
☆97Mar 23, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Sid2697 / HOI-Ref
View on GitHub
Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"
☆30Apr 16, 2024Updated 2 years ago
facebookresearch / EgoVLPv2
View on GitHub
Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]
☆110Jul 2, 2024Updated 2 years ago
showlab / videollm-online
View on GitHub
VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)
☆674Nov 26, 2025Updated 7 months ago
JoeLeelyf / OVO-Bench
View on GitHub
[CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?
☆153Jul 24, 2025Updated 11 months ago
OpenGVLab / video-mamba-suite
View on GitHub
The suite of modeling video with Mamba
☆295May 14, 2024Updated 2 years ago
haoningwu3639 / SpatialScore
View on GitHub
[CVPR 2026 Highlight] SpatialScore: Towards Comprehensive Evaluation for Spatial Intelligence
☆84May 28, 2026Updated last month
Jialuo-Li / DIG
View on GitHub
[CVPR 2026] Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding
☆21Feb 21, 2026Updated 4 months ago
Kami-code / HandsOnVLM-release
View on GitHub
HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction
☆41Sep 15, 2025Updated 10 months ago
SooLab / EyeWO
View on GitHub
[NeurIPS2025] The official PyTorch implementation of the "Eyes Wide Open: Ego Proactive Video-LLM for Streaming Video".
☆34Dec 25, 2025Updated 6 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yuecao0119 / MMInstruct
View on GitHub
[SCIS 2024] The official implementation of the paper "MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Di…
☆64Nov 7, 2024Updated last year
KangsanKim07 / VideoICL
View on GitHub
[CVPR2025] VideoICL: Confidence-based Iterative In-context Learning for Out-of-Distribution Video Understanding
☆23Mar 24, 2025Updated last year
zhengrongz / AoTD
View on GitHub
[CVPR 2025] Official PyTorch code of "Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation".
☆58May 25, 2025Updated last year
THU-KEG / LongWriter-V
View on GitHub
[ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models
☆24Mar 29, 2025Updated last year
facebookresearch / VidOSC
View on GitHub
Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)
☆37Sep 9, 2024Updated last year
hmxiong / StreamChat
View on GitHub
Official repo for "Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge" ICLR2025
☆111Mar 14, 2025Updated last year
fansunqi / AKeyS
View on GitHub
Agentic Keyframe Search for Video Question Answering
☆18Jun 30, 2026Updated 2 weeks ago