AutoLab-SAI-SJTU / Paper2RebuttalLinks
Official implementation of "Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance"
☆374Updated 2 weeks ago
Alternatives and similar repositories for Paper2Rebuttal
Users that are interested in Paper2Rebuttal are comparing it to the libraries listed below
Sorting:
- Official repo of "Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens"☆282Updated last month
- ☆169Updated 3 weeks ago
- Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?☆216Updated last month
- Cambrian-S: Towards Spatial Supersensing in Video☆488Updated last month
- A paper list for spatial reasoning☆638Updated 3 weeks ago
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆103Updated 7 months ago
- A list of works on video generation towards world model☆337Updated this week
- Official Implementation of "Geometrically-Constrained Agent for Spatial Reasoning"☆49Updated last month
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆203Updated 9 months ago
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆60Updated last month
- [ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness☆64Updated 6 months ago
- A collection of vision foundation models unifying understanding and generation.☆59Updated last year
- The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'☆197Updated 2 months ago
- [NeurIPS 2025] Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration☆113Updated 2 months ago
- [CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.☆198Updated 8 months ago
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆431Updated this week
- Visual Spatial Tuning☆172Updated last week
- MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence☆54Updated last month
- [NeurIPS 2025] 3DRS: MLLMs Need 3D-Aware Representation Supervision for Scene Understanding☆145Updated 2 months ago
- Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"☆109Updated last month
- Thinking in 360°: Humanoid Visual Search in the Wild☆114Updated 2 weeks ago
- ☆36Updated 10 months ago
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆157Updated last month
- SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding☆60Updated 7 months ago
- 📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.☆412Updated this week
- Official implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”☆78Updated 2 months ago
- ☆124Updated 3 months ago
- [CVPR 2024] Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training☆44Updated last year
- From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…☆77Updated last month
- LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling☆187Updated 2 weeks ago