Official Implementation of "UniFlow: A Unified Pixel Flow Tokenizer for Visual Understanding and Generation"
☆138Oct 17, 2025Updated 4 months ago
Alternatives and similar repositories for UniFlow
Users that are interested in UniFlow are comparing it to the libraries listed below
Sorting:
- 极不平衡样本下的预测☆40Oct 28, 2025Updated 4 months ago
- A navigation algorithm based on CMU team's open-source local planner☆118Oct 9, 2025Updated 5 months ago
- [ICDAR-DALL-2025] PALM-LAY is the first unified, cross-regional annotated dataset specifically designed for layout analysis of historical…☆40Dec 30, 2025Updated 2 months ago
- 一个在 JetBrains 上的插件:Tree Description 。可以为项目模块增加自定义备注,颜色分类、标注用途,还可以共享开源映射关系。☆212Jan 26, 2026Updated last month
- 开源 AI 命令行工具,将多模型 AI 智能体、智能工作流和规格驱动开发带入您的终端。(An open-source AI command-line tool that brings multi-model AI agents, intelligent workflows,…☆121Nov 23, 2025Updated 3 months ago
- Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders☆212Feb 13, 2026Updated 3 weeks ago
- Astron-xmod-shim — Lightweight, declarative middleware for reliably converging AI service workloads.☆100Nov 3, 2025Updated 4 months ago
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"☆275Oct 29, 2025Updated 4 months ago
- (NeurIPS 2025) Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation☆66Oct 14, 2025Updated 4 months ago
- flex-block-attn: an efficient block sparse attention computation library☆124Dec 26, 2025Updated 2 months ago
- [ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models☆33Dec 27, 2025Updated 2 months ago
- [ICLR 2026] Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?☆222Dec 15, 2025Updated 2 months ago
- ☆72Oct 18, 2025Updated 4 months ago
- PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025☆14Nov 21, 2025Updated 3 months ago
- [AAAI 2026] Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices☆95Nov 30, 2025Updated 3 months ago
- Official respository for ReasonGen-R1☆75Jun 23, 2025Updated 8 months ago
- Text2GraphRAG Disease Assistant builds a disease-focused retrieval-augmented generation workflow. It ingests structured Markdown (demo: o…☆42Nov 20, 2025Updated 3 months ago
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆198Jan 7, 2026Updated 2 months ago
- Code from blog 'Searching by Music: Leveraging Vector Search for Music Information Retrieval'☆16Nov 16, 2023Updated 2 years ago
- "FusionFactory: Fusing LLM Capabilities with Routing Data", Tao Feng, Haozhen Zhang, Zijie Lei, Pengrui Han, Mostofa Patwary, Mohammad Sh…☆19Dec 30, 2025Updated 2 months ago
- Estimating musical surprisal/information content in Audio☆23Jan 19, 2026Updated last month
- The config panel for ai sdk.☆96Nov 2, 2025Updated 4 months ago
- ☆47Apr 20, 2025Updated 10 months ago
- ☆140Feb 13, 2026Updated 3 weeks ago
- [ICCV25] USP: Unified Self-Supervised Pretraining for Image Generation and Understanding☆92Oct 11, 2025Updated 4 months ago
- ECCV2024, LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models☆18Aug 9, 2024Updated last year
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆79Oct 31, 2024Updated last year
- ☆21Jan 17, 2025Updated last year
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 8 months ago
- ☆28Mar 4, 2025Updated last year
- Toward Generalizing Visual Brain Decoding to Unseen Subjects☆28May 14, 2025Updated 9 months ago
- An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control☆31Jan 13, 2026Updated last month
- [Preprint] UCGM: Unified Continuous Generative Models☆182May 27, 2025Updated 9 months ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆287Dec 4, 2024Updated last year
- High-performance Image Tokenizers for VAR and AR☆303Apr 25, 2025Updated 10 months ago
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆245Aug 15, 2025Updated 6 months ago
- [NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation☆727Nov 27, 2025Updated 3 months ago
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆32Mar 26, 2025Updated 11 months ago
- RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing☆58Dec 26, 2025Updated 2 months ago