[CVPR 2026 Oral] Pixel Diffusion Transformers for Image Generation
β70Apr 9, 2026Updated last week
Alternatives and similar repositories for PixelDiT
Users that are interested in PixelDiT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β12Jul 18, 2024Updated last year
- [CVPR 2026 Highlightπ₯] MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAEβ152Apr 9, 2026Updated last week
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editingβ44Jan 9, 2026Updated 3 months ago
- ECCV2024, LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Modelsβ18Aug 9, 2024Updated last year
- The code for "Toward Accurate and Temporally Consistent Video Restoration from Raw Data"β16Dec 25, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flowβ27Apr 9, 2025Updated last year
- [NeurIPS DB 2025] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Renderingβ46Oct 15, 2025Updated 6 months ago
- FPR: False Positive Rectification for Weakly Supervised Semantic Segmentation (ICCV 2023)β24Sep 24, 2023Updated 2 years ago
- A modern, responsive academic personal website.β22Apr 5, 2025Updated last year
- Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024β31Jul 19, 2024Updated last year
- [CVPR 2026 Highlight] Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillationβ319Dec 15, 2025Updated 4 months ago
- [ECCV'24] A novel weakly supervised framework for 3D object detection from 2D bounding boxes. It can easily extend to novel scenarios andβ¦β36Jul 26, 2024Updated last year
- Simple reimplementation of Flow Matching for Generative Modeling (https://arxiv.org/abs/2210.02747) paper in PyTorchβ23Aug 10, 2024Updated last year
- Repo for USENIX security 2024 paper "On Data Fabrication in Collaborative Vehicular Perception: Attacks and Countermeasures" https://arxiβ¦β21Oct 19, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Predictionβ44Aug 9, 2025Updated 8 months ago
- [NeurIPS 2025 Spotlight] Neptune-X: Active X-to-Maritime Generation for Universal Maritime Object Detectionβ39Jan 15, 2026Updated 3 months ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.β21Sep 24, 2025Updated 6 months ago
- BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance Segmentation (CVPR2024)β13Jul 11, 2024Updated last year
- The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Geβ¦β11Jul 28, 2025Updated 8 months ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).β102Feb 11, 2025Updated last year
- β62May 27, 2025Updated 10 months ago
- Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representationsβ22Dec 24, 2025Updated 3 months ago
- β34Jul 24, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official Implementation of pMF https://arxiv.org/abs/2601.22158β213Feb 19, 2026Updated last month
- A tiny package supporting distributed computation of COCO metrics for PyTorch models.β15Feb 28, 2023Updated 3 years ago
- β35May 27, 2024Updated last year
- Original code base for On Pretraining Data Diversity for Self-Supervised Learningβ14Dec 30, 2024Updated last year
- [NeurIPS 2024] DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation for Dynamic Scene Renderingβ12Oct 22, 2024Updated last year
- β17Jul 24, 2025Updated 8 months ago
- Official PyTorch codes for "Enhancing Diffusion Models with Text-Encoder Reinforcement Learning", ECCV2024β58Aug 13, 2024Updated last year
- Make self forcing endless. Add cache purging. Add prompt controllability.β70Sep 9, 2025Updated 7 months ago
- Code release for "A New Benchmark: On the Utility of Synthetic Data with Blender for Bare Supervised Learning and Downstream Domain Adaptβ¦β13Mar 15, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Multitask Conversational Vision-Language Model for Radiologyβ16Jul 3, 2025Updated 9 months ago
- Code release for "MDQE: Mining Discriminative Query Embeddings to Segment Occluded Instances on Challenging Videos"(CVPR2023)β14Dec 14, 2023Updated 2 years ago
- [ICCV 2025] InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Modelsβ129Jan 23, 2026Updated 2 months ago
- ICML2025β64Aug 28, 2025Updated 7 months ago
- Code release for "BoxVIS: Video Instance Segmentation with Box Annotation"β12Dec 22, 2023Updated 2 years ago
- A data collection and processing pipeline for animal video, annotations include mask, keypoint, depth, occlusion, etc. Suitable for 3D/4Dβ¦β54Dec 5, 2025Updated 4 months ago
- [ICML 2025] Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modelingβ13May 5, 2025Updated 11 months ago