JeffreyXiang / MSRA-Intern-s-ToolkitLinks
☆17Updated 4 months ago
Alternatives and similar repositories for MSRA-Intern-s-Toolkit
Users that are interested in MSRA-Intern-s-Toolkit are comparing it to the libraries listed below
Sorting:
- LLM Reasoning Benchmark & Chain-of-Thoughts Dataset for Chemistry☆39Updated 3 months ago
- ☆191Updated 3 weeks ago
- GPT as a Monte Carlo Language Tree: A Probabilistic Perspective☆45Updated 11 months ago
- Code release for paper "Test-Time Training Done Right"☆350Updated last month
- ☆57Updated 4 months ago
- Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning☆56Updated 2 months ago
- ICLR2024 statistics☆48Updated 2 years ago
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆202Updated 2 months ago
- MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning☆136Updated 2 months ago
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆143Updated last year
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆89Updated last year
- Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning☆231Updated 8 months ago
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆285Updated 2 months ago
- [NeurIPS 2025] Official implementation for our paper "Scaling Diffusion Transformers Efficiently via μP".☆94Updated 2 months ago
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆46Updated 3 years ago
- ☆55Updated 7 months ago
- [CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation☆18Updated 8 months ago
- ☆37Updated 6 months ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆130Updated 11 months ago
- ☆60Updated 2 weeks ago
- A Video Tokenizer Evaluation Dataset☆147Updated 11 months ago
- ☆159Updated last year
- ☆51Updated 4 months ago
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆180Updated last month
- ☆18Updated last year
- Aims for memory-efficient training (24GB VRAM) on consumer GPUs. Optimizing language models through guidance tokens in reasoning chains, …☆29Updated 10 months ago
- A collection of vision foundation models unifying understanding and generation.☆59Updated last year
- Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning☆233Updated 7 months ago
- Generative Universal Verifier as Multimodal Meta-Reasoner☆43Updated last month
- Official repo for EscapeCraft (an 3D environment for room escape) and benchmark MM-Escape. This work is accepted by ICCV 2025.☆36Updated 6 months ago