JeffreyXiang / MSRA-Intern-s-Toolkit
☆16Updated 4 months ago
Alternatives and similar repositories for MSRA-Intern-s-Toolkit:
Users that are interested in MSRA-Intern-s-Toolkit are comparing it to the libraries listed below
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆46Updated 2 years ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆81Updated this week
- The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆90Updated 4 months ago
- A collection of vision foundation models unifying understanding and generation.☆42Updated 2 months ago
- A Video Tokenizer Evaluation Dataset☆104Updated 2 months ago
- GPT as a Monte Carlo Language Tree: A Probabilistic Perspective☆41Updated last month
- Unify and Simplify Discrete-time and Continuous-time Discrete Denoising Diffusion☆18Updated last month
- Official code for paper: Text-to-Image Rectified Flow as Plug-and-Play Priors [ICLR 2025]☆108Updated 3 weeks ago
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆107Updated last month
- Code for TFG: Unified Training-Free Guidance for Diffusion Models☆48Updated last month
- Code for the paper DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents, ICML 2024☆83Updated 9 months ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆118Updated last month
- 【Nature Computational Science 2025🔥】Deep peak property learning for efficient chiral molecules ECD spectra prediction☆39Updated 2 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆84Updated 5 months ago
- Unofficial implementation of "SODA: Bottleneck Diffusion Models for Representation Learning"☆82Updated 11 months ago
- ICLR2024 statistics☆47Updated last year
- (ICLR2024) This is the official PyTorch implementation of ICLR2024 paper: Text-to-3D with Classifier Score Distillation☆130Updated last year
- ☆117Updated 2 months ago
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models☆68Updated 9 months ago
- [CVPR 2025] Open implementation of "RandAR"☆58Updated 2 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆65Updated 2 weeks ago
- 🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation"☆255Updated 9 months ago
- List of papers on 4D Generation.☆244Updated 5 months ago
- GaussianDreamer extension of threestudio.☆48Updated 11 months ago
- Repo of HawkLlama.☆14Updated 2 months ago