fata1error404 / tsinghua-web-information-retrievalLinks
EmojiNotion – project for CBMI 2025 conference
☆14Updated 6 months ago
Alternatives and similar repositories for tsinghua-web-information-retrieval
Users that are interested in tsinghua-web-information-retrieval are comparing it to the libraries listed below
Sorting:
- Official repo and evaluation implementation of VSI-Bench☆652Updated 4 months ago
- CVPR 2024: Language Guided Generation of 3D Embodied AI Environments.☆509Updated 8 months ago
- A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vi…☆780Updated 4 months ago
- Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"☆272Updated 8 months ago
- A paper list for spatial reasoning☆495Updated 2 weeks ago
- [ICML 2024] Official code repository for 3D embodied generalist agent LEO☆470Updated 7 months ago
- Official implementation of Continuous 3D Perception Model with Persistent State☆1,232Updated 3 months ago
- Open source repo for Locate 3D Model, 3D-JEPA and Locate 3D Dataset☆399Updated 6 months ago
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆195Updated 7 months ago
- A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts…☆1,458Updated this week
- [ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆355Updated last month
- [NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"☆296Updated last year
- 🏘️ Scaling Embodied AI by Procedurally Generating Interactive 3D Houses☆403Updated 2 years ago
- Code for 3D-LLM: Injecting the 3D World into Large Language Models☆1,162Updated last year
- Compose multimodal datasets 🎹☆522Updated 4 months ago
- Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]☆942Updated 2 months ago
- ☆104Updated last month
- Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).☆236Updated last year
- A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and A…☆874Updated this week
- Code of π^3: Permutation-Equivariant Visual Geometry Learning☆1,460Updated last week
- Code for "Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers" (NeurIPS 2024)☆201Updated last month
- Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"☆1,315Updated 6 months ago
- [ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model☆607Updated last year
- A Visualization Tool for GPU Occupancy on S Cluster.☆13Updated 3 years ago
- A Shortcut-aware Video-QA Benchmark for Physical Understanding via Minimal Video Pairs☆35Updated 2 months ago
- 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding☆373Updated last week
- Official code for the CVPR 2025 paper "Navigation World Models".☆475Updated 3 weeks ago
- A curated list of papers and open-source resources focused on Physics-Inspired 3D Reconstruction and Simulation, intended to keep pace wi…☆47Updated 7 months ago
- Virtual Community: An Open World for Humans, Robots, and Society☆177Updated 3 weeks ago
- ☆31Updated 6 months ago