JeffreyXiang / MSRA-Intern-s-ToolkitLinks
☆17Updated 2 months ago
Alternatives and similar repositories for MSRA-Intern-s-Toolkit
Users that are interested in MSRA-Intern-s-Toolkit are comparing it to the libraries listed below
Sorting:
- ☆36Updated 4 months ago
- GPT as a Monte Carlo Language Tree: A Probabilistic Perspective☆45Updated 9 months ago
- [NeurIPS 2025] Official implementation for our paper "Scaling Diffusion Transformers Efficiently via μP".☆91Updated last week
- Code release for paper "Test-Time Training Done Right"☆307Updated 2 months ago
- ☆262Updated 3 weeks ago
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…☆66Updated 5 months ago
- ☆53Updated 2 months ago
- ICLR2024 statistics☆48Updated last year
- ☆51Updated 5 months ago
- Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning☆218Updated 6 months ago
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆173Updated 3 weeks ago
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆132Updated last year
- LLM Reasoning Benchmark & Chain-of-Thoughts Dataset for Chemistry☆35Updated last month
- 【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".☆36Updated 11 months ago
- Code for the paper DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents, ICML 2024☆89Updated last year
- A framework that allows you to apply Sparse AutoEncoder on any models☆41Updated 3 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆87Updated last year
- ☆54Updated last month
- A Video Tokenizer Evaluation Dataset☆136Updated 9 months ago
- A Collection of Papers on Diffusion Language Models☆137Updated last month
- ☆32Updated 6 months ago
- Generative Universal Verifier as Multimodal Meta-Reasoner☆31Updated 2 weeks ago
- Unify and Simplify Discrete-time and Continuous-time Discrete Denoising Diffusion☆23Updated 9 months ago
- ☆149Updated 10 months ago
- A collection of vision foundation models unifying understanding and generation.☆57Updated 10 months ago
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆46Updated 3 years ago
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆261Updated 3 weeks ago
- Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding☆164Updated 2 weeks ago
- Aims for memory-efficient training (24GB VRAM) on consumer GPUs. Optimizing language models through guidance tokens in reasoning chains, …☆30Updated 8 months ago
- Official respository for ReasonGen-R1☆72Updated 4 months ago