[ICLR2026] Advancing End-To-End Pixel-Space Generative Modeling Via Self-Supervised Pre-Training
☆138Dec 8, 2025Updated 3 months ago
Alternatives and similar repositories for EPG
Users that are interested in EPG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR26] NarrLV: Towards a Comprehensive Narrative-Centric Evaluation for Long Video Generation Models☆112Jul 28, 2025Updated 7 months ago
- [AAAI2026] ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints☆55Oct 23, 2025Updated 5 months ago
- Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language Model.☆80Jun 30, 2025Updated 8 months ago
- [ICCV 25] VMBench: A Benchmark for Perception-Aligned Video Motion Generation☆71Oct 10, 2025Updated 5 months ago
- [EMNLP25] Official code for "POSITION BIAS MITIGATES POSITION BIAS: Mitigate Position Bias Through Inter-Position Knowledge Distillation…☆25Nov 11, 2025Updated 4 months ago
- ☆17Mar 25, 2025Updated 11 months ago
- [ICLR 2026] Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation☆122Feb 15, 2026Updated last month
- [ICCV25] USP: Unified Self-Supervised Pretraining for Image Generation and Understanding☆92Oct 11, 2025Updated 5 months ago
- [ICLR26]GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning☆179Jan 29, 2026Updated last month
- [ICLR2026] Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models☆121Jan 30, 2026Updated last month
- [ICCV25] LD-RPS☆28Jul 17, 2025Updated 8 months ago
- UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning☆161Jun 2, 2025Updated 9 months ago
- [ICLR 2026] Tree Search for LLM Agent Reinforcement Learning☆306Jan 26, 2026Updated last month
- [ICLR2026] Implementation of "S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models"☆152Feb 2, 2026Updated last month
- Implementation of "FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing"☆442Nov 24, 2025Updated 3 months ago
- Music-Aligned Holistic 3D Dance Generation via Hierarchical Motion Modeling [ICCV 2025] Official PyTorch implementation☆34Nov 11, 2025Updated 4 months ago
- ☆23Jul 20, 2025Updated 8 months ago
- IntTravel: A Real-World Dataset and Generative Framework for Integrated Multi-Task Travel Recommendation☆57Feb 18, 2026Updated last month
- [TMLR 2026] GIOROM, sampling based model-order reduction for Lagrangian systems☆19Mar 12, 2026Updated last week
- ☆17Jul 16, 2025Updated 8 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆114Dec 4, 2025Updated 3 months ago
- Controlnet module for Wan2.2☆43Oct 30, 2025Updated 4 months ago
- Meteor trajectory viewer☆10Jan 12, 2026Updated 2 months ago
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆39Aug 3, 2025Updated 7 months ago
- Source code of "A structured dictionary perspective on implicit neural representations"☆61Apr 13, 2022Updated 3 years ago
- When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought☆27Feb 14, 2026Updated last month
- ☆14Dec 7, 2024Updated last year
- STeP: a general and scalable framework for solving video inverse problems with spatiotemporal diffusion priors☆29Jun 10, 2025Updated 9 months ago
- claw + whip: Event-to-channel notification router — bypasses gateway sessions to avoid context pollution☆61Updated this week
- ☆66Feb 1, 2026Updated last month
- ☆46Mar 12, 2026Updated last week
- Parameter-Efficient Fine-Tuning for Geospatial Foundation Models☆24Sep 18, 2025Updated 6 months ago
- ☆22Nov 18, 2025Updated 4 months ago
- ☆18Sep 27, 2023Updated 2 years ago
- ☆37Mar 21, 2025Updated last year
- GroundCUA☆69Mar 11, 2026Updated last week
- Code to train a Multiple-Input Fourier Neural Operator (MIFNO) to predict the solution of 3D source-dependent Partial Differential Equati…☆12Jan 11, 2026Updated 2 months ago
- ☆309May 29, 2025Updated 9 months ago
- [NeurIPS 2025 Spotlight] VisualQuality-R1 is the first open-sourced NR-IQA model can accurately describe and rate the image quality.☆160Oct 15, 2025Updated 5 months ago