worldbench/VideoLucy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/worldbench/VideoLucy)

worldbench / VideoLucy

[NeurIPS 2025] Deep Memory Backtracking for Long Video Understanding

☆68

Alternatives and similar repositories for VideoLucy

Users that are interested in VideoLucy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Zplusdragon / ReID5o_ORBench
View on GitHub
[NeurIPS2025] ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model
☆93Jan 8, 2026Updated 6 months ago
worldbench / awesome-3d-in-the-wild
View on GitHub
🌐 A Roadmap for 3D Scene Understanding in the Wild
☆33Dec 19, 2025Updated 7 months ago
worldbench / SPIRAL
View on GitHub
[NeurIPS 2025] SPIRAL: Semantic-Aware Progressive LiDAR Scene Generation and Understanding
☆44Jul 8, 2026Updated last week
Zplusdragon / CION_ReIDZoo
View on GitHub
[NeurIPS2024] Cross-video Identity Correlating for Person Re-identification Pre-training
☆106Jun 20, 2025Updated last year
worldbench / Calib3D
View on GitHub
[WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding
☆73Dec 6, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
worldbench / 3EED
View on GitHub
[NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3D
☆212Dec 26, 2025Updated 6 months ago
worldbench / LiDARCrafter
View on GitHub
[AAAI 2026 Oral] LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences
☆196Dec 12, 2025Updated 7 months ago
worldbench / Pi3DET
View on GitHub
[ICCV 2025] Perspective-Invariant 3D Object Detection
☆177Dec 22, 2025Updated 6 months ago
worldbench / WorldLens
View on GitHub
[CVPR 2026 Oral] WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World
☆240Jan 18, 2026Updated 6 months ago
robosense2025 / track3
View on GitHub
Track 3: Sensor Placement
☆19Aug 22, 2025Updated 10 months ago
robosense2025 / track4
View on GitHub
Track 4: Cross-Modal Drone Navigation
☆17Aug 28, 2025Updated 10 months ago
facebookresearch / egagent
View on GitHub
Code for "Agentic Very Long Video Understanding" (EGAgent) [ACL 2026 Main]
☆49Jul 1, 2026Updated 2 weeks ago
worldbench / DriveBench
View on GitHub
[ICCV 2025] Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives
☆244Dec 12, 2025Updated 7 months ago
robosense2025 / track5
View on GitHub
Track 5: Cross-Platform 3D Object Detection
☆22Aug 16, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
robosense2025 / track2
View on GitHub
Track 2: Social Navigation
☆27Aug 19, 2025Updated 11 months ago
wgcyeo / WorldMM
View on GitHub
[CVPR 2026 Highlight] WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning
☆96Jun 18, 2026Updated last month
robosense2025 / track1
View on GitHub
Track 1: Driving with Language
☆26Aug 23, 2025Updated 10 months ago
Jialuo-Li / DIG
View on GitHub
[CVPR 2026] Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding
☆21Feb 21, 2026Updated 4 months ago
worldbench / awesome-vla-for-ad
View on GitHub
🌐 Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future
☆444Jun 27, 2026Updated 3 weeks ago
EliSpectre / MM-Mem
View on GitHub
[ACL-26 (main)] From Verbatim to Gist Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video A…
☆39Apr 19, 2026Updated 3 months ago
egolife-ai / Ego-R1
View on GitHub
[TPAMI 2026] Ego-R1: Agentic Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning
☆165Jun 10, 2026Updated last month
xiaoqian-shen / Vgent
View on GitHub
[NeurIPS 2025 Spotlight] Official PyTorch implementation of Vgent
☆48Nov 30, 2025Updated 7 months ago
worldbench / SuperFlow
View on GitHub
[ECCV 2024] 4D Contrastive Superflows are Dense 3D Representation Learners
☆52Dec 4, 2025Updated 7 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
worldbench / awesome-3d-4d-world-models
View on GitHub
[TPAMI 2026] 3D and 4D World Modeling: A Survey
☆949Updated this week
Zplusdragon / UFineBench
View on GitHub
[CVPR2024] UFineBench: Towards Text-based Person Retrieval with Ultra-fine Granularity
☆81Sep 28, 2024Updated last year
Becomebright / ReKV
View on GitHub
[ICLR'25] Streaming Video Question-Answering with In-context Video KV-Cache Retrieval
☆121Nov 4, 2025Updated 8 months ago
LaVi-Lab / Rethink_CoT_Video
View on GitHub
Official code for "Rethinking Chain-of-Thought Reasoning for Videos"
☆21Dec 14, 2025Updated 7 months ago
synvo-ai / HippoCamp
View on GitHub
A benchmark for evaluating contextual agents on realistic multimodal personal-computer environments with profiling and factual-retention …
☆29Apr 2, 2026Updated 3 months ago
worldbench / awesome-spatial-intelligence
View on GitHub
🌐 Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems
☆150Jul 12, 2026Updated last week
worldbench / Robo3D
View on GitHub
[ICCV 2023] Robo3D: Towards Robust and Reliable 3D Perception against Corruptions
☆376Dec 6, 2025Updated 7 months ago
EvolvingLMMs-Lab / SimpleStream
View on GitHub
A simple video streaming baseline that outperforms SOTAs.
☆148May 1, 2026Updated 2 months ago
dywu98 / CBL-Conditional-Boundary-Loss
View on GitHub
The official implementation of IEEE-TIP paper “Conditional Boundary Loss for Semantic Segmentation”
☆20Nov 20, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
MILVLG / videoarm
View on GitHub
☆26Apr 9, 2026Updated 3 months ago
pufanyi / syphus
View on GitHub
Syphus: Automatic Instruction-Response Generation Pipeline
☆14Dec 14, 2023Updated 2 years ago
dongyh20 / Demo-ICL
View on GitHub
Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition
☆40Mar 3, 2026Updated 4 months ago
qiujihao19 / LongVideo-R1
View on GitHub
[CVPR 2026] LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding
☆50Jul 7, 2026Updated last week
nusnlp / d2vlm
View on GitHub
[ICCV 2025] Factorized Learning for Temporally Grounded Video-Language Models
☆24Apr 18, 2026Updated 3 months ago
CG-Bench / CG-Bench
View on GitHub
☆20Jan 26, 2025Updated last year
EvolvingLMMs-Lab / EgoLife
View on GitHub
[CVPR 2025] EgoLife: Towards Egocentric Life Assistant
☆447Mar 19, 2025Updated last year