SihanXU/nepa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SihanXU/nepa)

SihanXU / nepa

PyTorch implementation of NEPA

☆338

Alternatives and similar repositories for nepa

Users that are interested in nepa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

End2End-Diffusion / iREPA
View on GitHub
[ICLR 2026] Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?
☆256Dec 15, 2025Updated 7 months ago
ZitengWangNYU / Scale-RAE
View on GitHub
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
☆255Feb 13, 2026Updated 5 months ago
bytetriper / RAE
View on GitHub
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
☆1,977Feb 25, 2026Updated 4 months ago
paraynaud / MTH8408-Hiv24
View on GitHub
☆15Mar 25, 2024Updated 2 years ago
facebookresearch / pixio
View on GitHub
[CVPR 2026] Pixio: a capable vision encoder dedicated to dense prediction, simply by pixel reconstruction
☆457Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
MiniMax-AI / VTP
View on GitHub
[ECCV 2026] Towards Scalable Pre-training of Visual Tokenizers for Generation
☆495Apr 15, 2026Updated 3 months ago
LTH14 / JiT
View on GitHub
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
☆2,462Dec 8, 2025Updated 7 months ago
facebookresearch / webssl
View on GitHub
Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).
☆214Mar 20, 2026Updated 4 months ago
facebookresearch / tuna-2
View on GitHub
Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation
☆738Updated this week
cambrian-mllm / cambrian-s
View on GitHub
Cambrian-S: Towards Spatial Supersensing in Video
☆561Apr 3, 2026Updated 3 months ago
EvolvingLMMs-Lab / OneVision-Encoder
View on GitHub
Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence
☆385Jun 20, 2026Updated last month
google-deepmind / representations4d
View on GitHub
☆180Jun 8, 2026Updated last month
lillian039 / VARC
View on GitHub
☆246Nov 26, 2025Updated 7 months ago
FoundationVision / UniTok
View on GitHub
[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
☆529Nov 14, 2025Updated 8 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
G-U-N / consolver
View on GitHub
[CVPR 2026 (Highlight)] Unofficial Implementation of "Image Diffusion Preview with Consistency Solver"
☆30Jan 24, 2026Updated 5 months ago
ShivamDuggal4 / UNITE-tokenization-generation
View on GitHub
Single-stage End-to-End Training for Tokenization and Generation
☆117Mar 24, 2026Updated 3 months ago
akhileshthite / zipify-tunes
View on GitHub
Convert any playlist CSVs into MP3 files with metadata, bring back your MP3 player!
☆19Jul 4, 2026Updated 2 weeks ago
tensake / litehook
View on GitHub
Lightweight social media monitoring tool built with Rust
☆17Jun 10, 2026Updated last month
JackXing875 / NeneBot
View on GitHub
綾地寧々は世界一可愛い！
☆16Jul 14, 2026Updated last week
sihyun-yu / REPA
View on GitHub
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
☆1,679Mar 16, 2025Updated last year
QitaoZhao / E-RayZer
View on GitHub
[CVPR 2026] "E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training" official implementation.
☆301May 30, 2026Updated last month
End2End-Diffusion / REPA-E
View on GitHub
[ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers
☆511Dec 6, 2025Updated 7 months ago
Echo-Team-Joy-Future-Academy-JD / Echo-Memory
View on GitHub
A Simple Baseline for Video World Models with Memory
☆220Jul 11, 2026Updated last week
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
littlepure2333 / C4D
View on GitHub
[ICCV 2025] C4D: 4D Made from 3D through Dual Correspondences
☆25Oct 17, 2025Updated 9 months ago
Belyenochi / openclaw-edd
View on GitHub
Evaluation-Driven Development for OpenClaw agents — mine golden cases from real sessions, catch regressions before they ship.
☆18Mar 17, 2026Updated 4 months ago
hustvl / VGT
View on GitHub
Visual Generation Tuning
☆101Apr 16, 2026Updated 3 months ago
hustvl / LightningDiT
View on GitHub
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
☆1,508Dec 16, 2025Updated 7 months ago
facebookresearch / vjepa2
View on GitHub
PyTorch code and models for VJEPA2 self-supervised learning from video.
☆4,376Mar 23, 2026Updated 3 months ago
vision-x-nyu / pisa-experiments
View on GitHub
Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)
☆59May 8, 2025Updated last year
vishnu97770 / VELOTYPE
View on GitHub
Adaptive AI-powered typing practice system that analyzes repeated user mistakes and generates personalized corrective tasks using FastAPI…
☆16Jun 6, 2026Updated last month
SnowCharmQ / DEP
View on GitHub
[2025 EMNLP Main (oral)] Latent Inter-User Difference Modeling for LLM Personalization
☆17Sep 16, 2025Updated 10 months ago
PKU-YuanGroup / UniSandBox
View on GitHub
Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward
☆60Nov 27, 2025Updated 7 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
assafbk / OPRM
View on GitHub
Overflow Prevention Enhances Long-Context Recurrent LLMs (COLM 2025)
☆18Jul 8, 2025Updated last year
jaredrummler / consoul
View on GitHub
A beautiful terminal-based AI chat interface built with Textual and LangChain
☆15Jan 7, 2026Updated 6 months ago
ErikZ719 / CoTA
View on GitHub
[ICLR 26] Context Tokens are Anchors: Understanding the Repeat Curse in dMLLMs from an Information Flow Perspective
☆16Mar 6, 2026Updated 4 months ago
likaiucas / DragOSM
View on GitHub
TPAMI Underreview paper: DragOSM
☆19Feb 26, 2026Updated 4 months ago
shiml20 / SVG
View on GitHub
[ICLR 2026] Official PyTorch Implementation of "Latent Diffusion Model Without Variational Autoencoder".
☆457Dec 15, 2025Updated 7 months ago
Jivoronix / blockchain-data-validator
View on GitHub
☆15Jan 31, 2025Updated last year
amazon-far / BAR
View on GitHub
[ICML 2026] code & model for arxiv paper "Autoregressive Image Generation with Masked Bit Modeling"
☆59May 1, 2026Updated 2 months ago