☆30Mar 30, 2025Updated last year
Alternatives and similar repositories for V2Flow
Users that are interested in V2Flow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation for the CVPR 2024 paper CAMEL☆20Jun 20, 2024Updated last year
- ☆19Apr 1, 2025Updated last year
- [WACV 2025] Uniform Attention Maps: Enhancing Image Fidelity in Reconstruction and Editing☆17Mar 16, 2025Updated last year
- ☆28Apr 25, 2025Updated 11 months ago
- A curated list of papers and resources for text-to-image evaluation.☆30Sep 6, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆86Jul 13, 2024Updated last year
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 6 months ago
- [CVPR 2025] Official repository for "From Poses to Identity: Training-Free Person Re-Identification via Feature Centralization"☆149Dec 3, 2025Updated 4 months ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 3 months ago
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation☆48Jul 1, 2025Updated 9 months ago
- [ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets☆65Aug 6, 2025Updated 8 months ago
- ☆12Oct 5, 2024Updated last year
- Code for paper "Masked Pre-training Enables Universal Zero-shot Denoiser" [NeurIPS 2024].☆35Nov 20, 2024Updated last year
- Explore how to get a VQ-VAE models efficiently!☆69Jul 24, 2025Updated 8 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆248Oct 12, 2025Updated 6 months ago
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"☆44Mar 11, 2025Updated last year
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆176Mar 18, 2026Updated 3 weeks ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Aug 30, 2023Updated 2 years ago
- [NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation☆22Dec 17, 2024Updated last year
- Code release for "Category-Specific Prompts for Animal Action Recognition with Pretrained Vision-Language Models"☆14Feb 21, 2024Updated 2 years ago
- ICML2025☆64Aug 28, 2025Updated 7 months ago
- [ICCV 2021] Multimodal Knowledge Expansion☆10Aug 28, 2021Updated 4 years ago
- [CVPR'26] AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space☆26Mar 15, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆29Apr 15, 2025Updated 11 months ago
- Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation☆23Sep 24, 2025Updated 6 months ago
- [SIGGRAPH 2025] AssetDropper: Asset Extraction via Diffusion Models with Reward-Driven Optimization☆34Jun 19, 2025Updated 9 months ago
- Repository of PIXAR, a Pixel-based Auto-Regressive Language Model☆18Sep 15, 2025Updated 6 months ago
- Official repository for HOComp: Interaction-Aware Human-Object Composition☆30Dec 3, 2025Updated 4 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆166Jan 31, 2025Updated last year
- Source code for EAC-Net in Theano/Pytorch/Tensorflow☆20Jan 16, 2018Updated 8 years ago
- We introduce Chart2Code, the first user-driven, hierarchical benchmark that systematically evaluates Large Multimodal Models on chart-to-…☆25Jan 27, 2026Updated 2 months ago
- [ACMMM 2025] ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies☆22Jun 20, 2025Updated 9 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Video-Text Representation Learning via Differentiable Weak Temporal Alignment (PyTorch implementation for the CVPR 2022 paper)☆11Oct 12, 2022Updated 3 years ago
- ☆12Jun 12, 2024Updated last year
- Support code for controlnet diffuser step of "A Diffusion Approach to Radiance Field Relighting using Multi-Illumination Synthesis", EGSR…☆13Sep 20, 2024Updated last year
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆28Apr 22, 2025Updated 11 months ago
- [TCSVT 2025] Official code release of our paper "Towards Explainable Image Aesthetics Assessment With Attribute-Oriented Critiques Genera…☆23Oct 26, 2025Updated 5 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆73Jul 13, 2025Updated 9 months ago
- [ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…☆155Jul 24, 2025Updated 8 months ago