HY-WU (Part I): An Extensible Functional Neural Memory Framework and An Instantiation in Text-Guided Image Editing
☆129Mar 7, 2026Updated this week
Alternatives and similar repositories for HY-WU
Users that are interested in HY-WU are comparing it to the libraries listed below
Sorting:
- [CVPR 2026] Adaptive Spectral Feature Forecasting for Diffusion Sampling Acceleration☆66Updated this week
- Official implementation of "LoFA: Learning to Predict Personalized Prior for Fast Adaptation of Visual Generative Models".☆35Feb 1, 2026Updated last month
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆115Jul 9, 2025Updated 8 months ago
- Long-range camera-conditioned scene generation from one single image.☆105Dec 23, 2025Updated 2 months ago
- JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers☆17Jul 21, 2025Updated 7 months ago
- MMD viewer powered by Babylon.js and babylon-mmd☆16Aug 2, 2025Updated 7 months ago
- ☆44Nov 26, 2025Updated 3 months ago
- VideoNSA: Native Sparse Attention Scales Video Understanding☆81Nov 16, 2025Updated 3 months ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆130May 22, 2025Updated 9 months ago
- [SIGGRAPH Asia 2025] Official github repo of SeqTex, an end-to-end 3D texture generation method using video diffusion priors.☆39Dec 12, 2025Updated 2 months ago
- [ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation☆91Feb 7, 2026Updated last month
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆198Jan 7, 2026Updated 2 months ago
- [ICCV 2025] CompleteMe: Reference-based Human Image Completion☆26Jan 20, 2026Updated last month
- Multimodal RewardBench☆62Feb 21, 2025Updated last year
- [AAAI 2026] Minute-Long Videos with Dual Parallelisms☆46Nov 12, 2025Updated 3 months ago
- ☆43Sep 1, 2025Updated 6 months ago
- A curated list of papers and resources for text-to-image evaluation.☆30Sep 6, 2023Updated 2 years ago
- Motion programming language☆50Feb 10, 2026Updated 3 weeks ago
- Research on training an LLM with DeepSeek & Kimi architecture☆40Sep 30, 2025Updated 5 months ago
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆245Aug 15, 2025Updated 6 months ago
- The official implementation of Recurrent Diffusion for Large-Scale Parameter Generation.☆78Sep 24, 2025Updated 5 months ago
- [ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models☆111Oct 10, 2024Updated last year
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆32Dec 7, 2023Updated 2 years ago
- Fine-tuned LLMs generate accurate 3D human avatars from textual descriptions using the SMPL-X model, enhancing customization and simulati…☆37Feb 5, 2025Updated last year
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆73Dec 27, 2024Updated last year
- Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.☆902Aug 27, 2025Updated 6 months ago
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆670Oct 14, 2025Updated 4 months ago
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆34Mar 11, 2025Updated 11 months ago
- a unified reinforcement learning toolbox for joint RL on language models and diffusion models☆75Feb 7, 2026Updated last month
- [ICCV2023] Dataset Quantization☆263Jan 6, 2024Updated 2 years ago
- [CVPR 2026] Official implementation of BiCo: Composing Concepts from Images and Videos via Concept-prompt Binding☆76Feb 22, 2026Updated 2 weeks ago
- [NeurIPS 2025] ARMesh: Autoregressive Mesh Generation via Next-Level-of-Detail Prediction☆61Jan 27, 2026Updated last month
- ☆77May 4, 2025Updated 10 months ago
- [🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s …☆648Feb 27, 2026Updated last week
- The official repo of continuous speculative decoding☆31Mar 28, 2025Updated 11 months ago
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆65Sep 27, 2025Updated 5 months ago
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆219Oct 12, 2025Updated 4 months ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Sep 12, 2023Updated 2 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago