[ICLR 2026] Code for our paper "Next Visual Granularity Generation".
☆53Jan 26, 2026Updated 4 months ago
Alternatives and similar repositories for nvg
Users that are interested in nvg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS'25 Spotlight] Boosting Generative Image Modeling via Joint Image-Feature Synthesis☆119Nov 3, 2025Updated 6 months ago
- This repository contains the code for the IEEE Robotics and Automation Letters paper "Open-Set Object Detection Using Classification-Free…☆14Dec 6, 2023Updated 2 years ago
- Code for AAAI2024 paper: Towards Evidential and Class Separable Open Set Object Detection☆12Dec 23, 2023Updated 2 years ago
- ☆16Sep 1, 2025Updated 8 months ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆46Nov 17, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆44Sep 30, 2024Updated last year
- ICLR 2026-MVAR: Visual Autoregressive Modeling with Scale and Spatial Markovian Conditioning☆37Apr 17, 2026Updated last month
- ☆35Nov 17, 2025Updated 6 months ago
- [ICLR 2026] This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA bench…☆97Jan 26, 2026Updated 4 months ago
- This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories☆27Mar 20, 2025Updated last year
- ☆20Dec 8, 2024Updated last year
- ☆66Jul 11, 2025Updated 10 months ago
- Code for "How far can we go with ImageNet for Text-to-Image generation?" paper☆96Nov 13, 2025Updated 6 months ago
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 7 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation☆35Sep 16, 2025Updated 8 months ago
- Explaining audio differences using language☆16Feb 11, 2025Updated last year
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated 2 years ago
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems☆13Jan 16, 2025Updated last year
- The official code for paper "GPSToken: Gaussian Parameterized Spatially-adaptive Tokenization for Image Representation and Generation"☆55Apr 3, 2026Updated last month
- [ICLR'26] Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?☆53Mar 9, 2026Updated 2 months ago
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆38Feb 11, 2025Updated last year
- (IEEE TCSVT) 3DPortraitGAN: Learning One-Quarter Headshot 3D GANs from a Single-View Portrait Dataset with Diverse Body Poses☆33Jul 9, 2025Updated 10 months ago
- [MICCAI 2024] Embracing Massive Medical Data☆20Jul 5, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official codebase for Reflected Flow Matching (ICML 2024)☆23Jun 19, 2024Updated last year
- [ICLR2026] Video-GPT via Next Clip Diffusion.☆45Jun 2, 2025Updated 11 months ago
- SA-LUT: Spatial Adaptive 4D Look-Up Table for Photorealistic Style Transfer☆48Nov 10, 2025Updated 6 months ago
- This is the official PyTorch implementation of the paper "Rethinking Re-Sampling in Imbalanced Semi-Supervised Learning" (Ju He, Adam Kor…☆25Nov 18, 2021Updated 4 years ago
- [CVPR 2025 Oral] PyTorch re-implementation for Autoregressive Distillation of Diffusion Transformers (ARD).☆144Oct 1, 2025Updated 7 months ago
- ☆58Dec 16, 2024Updated last year
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆108Sep 27, 2025Updated 8 months ago
- This repo provides the codebase for "A General Framework for Weak Supervision"☆40Jun 3, 2024Updated last year
- HyperCUT: Video Sequence from a Single Blurry Image using Unsupervised Ordering (CVPR'23)☆14Nov 4, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Text-guided 3D texture generation using training-free multi-diffusion in UV space.☆14Apr 7, 2025Updated last year
- DiP: Taming Diffusion Models in Pixel Space☆59May 13, 2026Updated 2 weeks ago
- ☆15Dec 16, 2023Updated 2 years ago
- VideoAuteur: Towards Long Narrative Video Generation☆44Oct 22, 2025Updated 7 months ago
- [ICML 2026] LaST$_0$: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model☆70Apr 30, 2026Updated 3 weeks ago
- [NeurlPS' 25] InstructRestore: Region-Customized Image Restoration with Human Instructions☆50Oct 23, 2025Updated 7 months ago
- We propose MMAD, a novel automated pipeline for precise AD generation. MMAD introduces ambient music alongside visual and linguistic, enh…☆17Dec 31, 2024Updated last year