Hans-Lan / DASC7606-A2Links
☆15Updated last year
Alternatives and similar repositories for DASC7606-A2
Users that are interested in DASC7606-A2 are comparing it to the libraries listed below
Sorting:
- ☆25Updated last year
- ArxivFlow - Periodic Track on arXiv Paper☆49Updated 3 months ago
- Virtual Community: An Open World for Humans, Robots, and Society☆177Updated 3 weeks ago
- [ICCV 2025] CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games☆25Updated 3 weeks ago
- ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models☆66Updated 6 months ago
- ICLR 2025 Agent-Related Papers☆74Updated last year
- ☆42Updated 7 months ago
- An AI agent that automates the creation of CVPR/NeurIPS standard academic diagrams. Implements a strict "Logic (Architect) -> Vision (Ren…☆175Updated this week
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆195Updated 7 months ago
- Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆82Updated 4 months ago
- VeriWeb: Verifiable Long-Chain Web Benchmark for Agentic Information-Seeking☆82Updated this week
- MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning☆136Updated 2 months ago
- Official implementation of X-Master, a general-purpose tool-augmented reasoning agent.☆292Updated last month
- CycleResearcher: Improving Automated Research via Automated Review☆309Updated 5 months ago
- Official Repository for PosterGen☆199Updated 2 weeks ago
- Collection of recent methods on 3D Scene Generation from Text Description.☆16Updated 9 months ago
- Using message app/bot to notify you when doing time-consuming tasks. Bake your experiments!☆83Updated last month
- Thinking in 360°: Humanoid Visual Search in the Wild☆78Updated last week
- Offcial Code of EyeReal☆83Updated last week
- ☆21Updated 3 years ago
- Collection of Highlight papers☆43Updated last year
- A python script for downloading huggingface datasets and models.☆20Updated 8 months ago
- STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?☆33Updated 5 months ago
- ☆104Updated last month
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆99Updated 5 months ago
- Ever wondered how popular your GitHub repo is compared to others?☆16Updated 4 months ago
- [CVPR2024] This is the official implement of MP5☆106Updated last year
- [ACL 2025 Main] Multi-Agent System for Science of Science☆116Updated 4 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆216Updated last month
- We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…☆219Updated this week