PyTorch implementation of NEPA
☆327Feb 9, 2026Updated last month
Alternatives and similar repositories for nepa
Users that are interested in nepa are comparing it to the libraries listed below
Sorting:
- Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).☆203Updated this week
- [WACV 2025] Official code of "SEED4D: A Synthetic Ego-Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark"☆21Sep 3, 2025Updated 6 months ago
- [ICLR 2026] Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?☆227Dec 15, 2025Updated 3 months ago
- [Arxiv'25] DINO-Tok: Adapting DINO for Visual Tokenizers☆35Nov 25, 2025Updated 3 months ago
- [TPAMI2025] BackMix: Regularizing Open Set Recognition by Removing Underlying Fore-Background Priors☆15Apr 23, 2025Updated 10 months ago
- Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders☆223Feb 13, 2026Updated last month
- This is the project for 'USG'.☆37Apr 7, 2025Updated 11 months ago
- ☆23Mar 4, 2026Updated 2 weeks ago
- Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation☆35Jun 30, 2025Updated 8 months ago
- ☆81Feb 24, 2026Updated 3 weeks ago
- Official Implementation of "Semantics-Consistent Feature Search for Self-Supervised Visual Representation Learning" in AAAI2024.☆13Feb 28, 2024Updated 2 years ago
- [ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆1,581Mar 16, 2025Updated last year
- ☆13Jan 14, 2026Updated 2 months ago
- [ICLR 2026 Oral (top 1.2%)] Official implementation of DepthLM☆320Mar 2, 2026Updated 2 weeks ago
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆54May 8, 2025Updated 10 months ago
- [ICCV 2025] Official code for Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation☆57Sep 12, 2025Updated 6 months ago
- Switch EMA: A Free Lunch for Better Flatness and Sharpness☆28Feb 16, 2024Updated 2 years ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆42Feb 12, 2025Updated last year
- FloodDiffusion: Tailored Diffusion Forcing for Streaming Motion Generation☆65Jan 13, 2026Updated 2 months ago
- ☆50Sep 26, 2025Updated 5 months ago
- Towards Scalable Pre-training of Visual Tokenizers for Generation☆460Mar 9, 2026Updated last week
- Code for Paper "The Geometry of Reasoning: Flowing Logics in Representation Space" (ICLR 2026)☆46Jan 31, 2026Updated last month
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆53Dec 28, 2025Updated 2 months ago
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"☆426Jun 20, 2025Updated 9 months ago
- [CVPR 2026] DDT: Decoupled Diffusion Transformer☆373Aug 22, 2025Updated 7 months ago
- DepthART official implementation☆24Oct 28, 2024Updated last year
- [CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Mo…☆330Jun 8, 2025Updated 9 months ago
- [ICCV 2025] Official Implementation of Steering Rectified Flow Models in the Vector Field for Controlled Image Generation☆45Jun 27, 2025Updated 8 months ago
- ☆35Feb 15, 2026Updated last month
- Official repository for "Vid2World: Crafting Video Diffusion Models to Interactive World Models" (ICLR 2026), https://arxiv.org/abs/2505.…☆44Jan 27, 2026Updated last month
- [ICCV 2025] Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"☆50Mar 20, 2025Updated last year
- Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"☆56Feb 2, 2026Updated last month
- ☆26Aug 12, 2025Updated 7 months ago
- (NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps☆25Jan 22, 2026Updated 2 months ago
- [NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations☆201Sep 18, 2025Updated 6 months ago
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding☆518Nov 14, 2025Updated 4 months ago
- [NeurIPS '25 Spotlight] Official Pytorch implementation of "Vision Transformers Don't Need Trained Registers"☆173Sep 19, 2025Updated 6 months ago
- Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning☆235Jan 22, 2026Updated last month
- ☆50Dec 25, 2025Updated 2 months ago