Kai-46 / minFMLinks
☆162Updated last month
Alternatives and similar repositories for minFM
Users that are interested in minFM are comparing it to the libraries listed below
Sorting:
- ☆115Updated 2 months ago
- Code release for paper "Test-Time Training Done Right"☆295Updated last month
- [CVPR 2025 Oral] PyTorch re-implementation for Autoregressive Distillation of Diffusion Transformers (ARD).☆129Updated last week
- Implementation of "Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision"☆165Updated last year
- Code for PointInfinity: Resolution-Invariant Point Diffusion Models☆34Updated last year
- ☆143Updated 9 months ago
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆132Updated 8 months ago
- ☆54Updated 2 months ago
- Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the …☆79Updated this week
- Official Implementation of Rethinking Score Distillation as a Bridge Between Image Distributions☆83Updated 6 months ago
- Official code for NeurIPS 2024 paper LRM-Zero: Training Large Reconstruction Models with Synthesized Data☆153Updated last year
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆145Updated 2 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆151Updated 8 months ago
- [CVPR 2024 Highlight] ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models☆173Updated last year
- Official Implementation of Posterior Distillation Sampling☆91Updated 3 months ago
- Official code for paper: Text-to-Image Rectified Flow as Plug-and-Play Priors [ICLR 2025]☆135Updated 5 months ago
- Implementation of LVSM, SOTA Large View Synthesis with Minimal 3d Inductive Bias, from Adobe Research☆102Updated 7 months ago
- PixNerd: Pixel Neural Field Diffusion☆121Updated 3 weeks ago
- Geometry-aware Novel View Synthesis with Pre-trained 2D Prior☆39Updated 2 years ago
- [ICLR'24] GTA: A Geometry-Aware Attention Mechanism for Multi-view Transformers☆145Updated 5 months ago
- MEt3R: Measuring Multi-View Consistency in Generated Images☆136Updated 2 months ago
- Generative Omnimatte (CVPR 2025)☆139Updated 4 months ago
- Evaluating Multiview Object Correspondence between Humans and Image models☆20Updated 8 months ago
- Unofficial implementation of 2D ProlificDreamer☆144Updated 9 months ago
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆122Updated 4 months ago
- [ECCV 2024] Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Models☆110Updated 10 months ago
- Code for SPAD : Spatially Aware Multiview Diffusers, CVPR 2024☆173Updated 8 months ago
- Official code repository of "HyperDiffusion: Generating Implicit Neural Fields with Weight-Space Diffusion" @ ICCV 2023☆196Updated last year
- Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance☆44Updated 4 months ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆102Updated 6 months ago