[ICLR 2026] Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?
☆221Dec 15, 2025Updated 2 months ago
Alternatives and similar repositories for iREPA
Users that are interested in iREPA are comparing it to the libraries listed below
Sorting:
- Official Repo of From Masks to Worlds: A Hitchhiker’s Guide to World Models.☆74Oct 26, 2025Updated 4 months ago
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆164Jan 7, 2026Updated last month
- PyTorch implementation of NEPA☆318Feb 9, 2026Updated 3 weeks ago
- Official Implementation of "UniFlow: A Unified Pixel Flow Tokenizer for Visual Understanding and Generation"☆137Oct 17, 2025Updated 4 months ago
- The official PyTorch implementation of RefRef: A Synthetic Dataset and Benchmark for Reconstructing Refractive and Reflective Objects☆15Updated this week
- The code contains updated files to sucessfully compile Minkowski engine to latest cuda and pytorch versions☆23Oct 22, 2025Updated 4 months ago
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆14Aug 8, 2025Updated 6 months ago
- [ICLR 2026] Self-Representation Alignment for Diffusion Transformers (SRA)☆111Feb 22, 2026Updated last week
- [CVPR2026] Official repository for “DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation”☆176Updated this week
- A Cross-Platform Backend for High-Performance Sparse Convolutions☆108Feb 2, 2026Updated last month
- [Siggraph Asia 25] SS4D: Native 4D Generative Model via Structured Spacetime Latents☆32Dec 17, 2025Updated 2 months ago
- FreeCond: A Free Lunch for Input Conditions in Text-Guided Inpainting. FreeCond introduces a more generalized form💪 of the original inpa…☆15May 22, 2025Updated 9 months ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated last year
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆35Nov 25, 2024Updated last year
- Official PyTorch Code for our ICCV25 paper- Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-Resolution☆85Aug 6, 2025Updated 6 months ago
- [ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆1,553Mar 16, 2025Updated 11 months ago
- This is the official PyTorch implementation of TBSR. Our team received 2nd place (real data track) and 3rd place (synthetic track) in NTI…☆14Jun 11, 2022Updated 3 years ago
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆92Feb 19, 2026Updated last week
- StableWorld: Towards Stable and Consistent Long Interactive Video Generation☆81Feb 3, 2026Updated 3 weeks ago
- ☆15Mar 30, 2025Updated 11 months ago
- Implementation of DGMA2-Net: A Difference-Guided Mutiscale Aggregation Attention Network for Remote Sensing Image Change Detection☆17Sep 17, 2024Updated last year
- ☆37Dec 25, 2025Updated 2 months ago
- ☆11Oct 6, 2022Updated 3 years ago
- [ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers☆454Dec 6, 2025Updated 2 months ago
- [ICLR2026] AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model☆53Oct 12, 2025Updated 4 months ago
- [ICCV 2025] Official Implementation of Contrastive Flow Matching☆161Jun 25, 2025Updated 8 months ago
- Official Code for NeurIPS 2022 Paper: How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders☆69Nov 17, 2023Updated 2 years ago
- ☆48Feb 9, 2026Updated 3 weeks ago
- Cuda mesh utils.☆173Feb 10, 2026Updated 2 weeks ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆43Mar 11, 2025Updated 11 months ago
- [NeurIPS'25 Spotlight] Boosting Generative Image Modeling via Joint Image-Feature Synthesis☆115Nov 3, 2025Updated 3 months ago
- [CVPR 2026] DDT: Decoupled Diffusion Transformer☆363Aug 22, 2025Updated 6 months ago
- Are Video Models Ready as Zero-shot Reasoners?☆84Nov 24, 2025Updated 3 months ago
- [Arxiv'25] DINO-Tok: Adapting DINO for Visual Tokenizers☆35Nov 25, 2025Updated 3 months ago
- Synthetic Alphabet Dataset☆19Mar 27, 2025Updated 11 months ago
- Adapting Self-Supervised Representations as a Latent Space for Efficient Generation☆40Oct 17, 2025Updated 4 months ago
- Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).☆201Apr 29, 2025Updated 10 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆183Mar 20, 2025Updated 11 months ago
- This is the official PyTorch implement of MW-ISPNet. Our team received a winner award in the AIM 2020 Learned Image ISP Challenge (ECCVW …☆41Jun 11, 2022Updated 3 years ago