dannyXSC / Fudan_FreshmanTestLinks
复旦研究生入学教育测试
☆14Updated last year
Alternatives and similar repositories for Fudan_FreshmanTest
Users that are interested in Fudan_FreshmanTest are comparing it to the libraries listed below
Sorting:
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.☆227Updated last week
- [ICLR2025] Official code implementation of Video-UTR: Unhackable Temporal Rewarding for Scalable Video MLLMs☆54Updated 3 months ago
- A python script for downloading huggingface datasets and models.☆19Updated last month
- RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints☆48Updated last week
- Accepted by CVPR 2024☆33Updated last year
- SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation☆159Updated last month
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks☆122Updated last week
- ⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆157Updated 2 weeks ago
- [LLaVA-Video-R1]✨First Adaptation of R1 to LLaVA-Video (2025-03-18)☆28Updated 3 weeks ago
- [CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…☆39Updated last month
- Collections of Papers and Projects for Multimodal Reasoning.☆105Updated last month
- Latest Advances on (RL based) Multimodal Reasoning and Generation in Multimodal Large Language Models☆25Updated this week
- AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation☆75Updated 2 months ago
- [arXiv 2025] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence☆26Updated this week
- OpenHelix: An Open-source Dual-System VLA Model for Robotic Manipulation☆175Updated this week
- Official code for MotionBench (CVPR 2025)☆40Updated 3 months ago
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆142Updated 3 months ago
- R1-like Video-LLM for Temporal Grounding☆93Updated last week
- HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model☆230Updated last month
- The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`☆124Updated 5 months ago
- ☆89Updated last month
- A paper list for spatial reasoning☆82Updated this week
- [CVPR2024] This is the official implement of MP5☆102Updated 11 months ago
- Official code implementation of Perception R1: Pioneering Perception Policy with Reinforcement Learning☆192Updated last week
- 哈尔滨工业大学2023春季学期编译系统课程实验、习题、课件以及期末复习材料☆11Updated last year
- [Arxiv 2025: MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation]☆35Updated 2 months ago
- Latent Motion Token as the Bridging Language for Robot Manipulation☆89Updated 3 weeks ago
- ☆119Updated 3 months ago
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆74Updated 7 months ago
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆187Updated 11 months ago