JeffreyXiang / MSRA-Intern-s-ToolkitLinks
☆17Updated 3 months ago
Alternatives and similar repositories for MSRA-Intern-s-Toolkit
Users that are interested in MSRA-Intern-s-Toolkit are comparing it to the libraries listed below
Sorting:
- Code release for paper "Test-Time Training Done Right"☆321Updated last week
- GPT as a Monte Carlo Language Tree: A Probabilistic Perspective☆45Updated 10 months ago
- ICLR2024 statistics☆48Updated last year
- LLM Reasoning Benchmark & Chain-of-Thoughts Dataset for Chemistry☆37Updated last month
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆185Updated last month
- Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning☆53Updated last month
- ☆36Updated 5 months ago
- ☆55Updated 3 months ago
- [NeurIPS 2025] Official implementation for our paper "Scaling Diffusion Transformers Efficiently via μP".☆92Updated 3 weeks ago
- Aims for memory-efficient training (24GB VRAM) on consumer GPUs. Optimizing language models through guidance tokens in reasoning chains, …☆29Updated 9 months ago
- Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning☆222Updated 7 months ago
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆271Updated last month
- ☆150Updated 10 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆87Updated last year
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆22Updated last year
- Code for the paper DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents, ICML 2024☆91Updated last year
- A collection of vision foundation models unifying understanding and generation.☆59Updated 10 months ago
- ☆55Updated last month
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆134Updated last year
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆53Updated last week
- ☆16Updated last year
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆78Updated 5 months ago
- ☆283Updated last month
- A Video Tokenizer Evaluation Dataset☆139Updated 10 months ago
- Unify and Simplify Discrete-time and Continuous-time Discrete Denoising Diffusion☆23Updated 10 months ago
- Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision☆170Updated 2 weeks ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆131Updated 10 months ago
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆164Updated last week
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…☆72Updated 6 months ago
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆46Updated 3 years ago