hao-ai-lab / JacobiForcingLinks
Jacobi Forcing: Fast and Accurate Diffusion-style Decoding
β147Updated last week
Alternatives and similar repositories for JacobiForcing
Users that are interested in JacobiForcing are comparing it to the libraries listed below
Sorting:
- ππ Efficient implementations of Native Sparse Attentionβ1,044Updated 3 months ago
- R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimizationβ448Updated 3 weeks ago
- Official repository for the paper "TIIF-Bench: How Does Your T2I Model Follow Your Instructions?".β159Updated last month
- codebase for iccv 2025 paper "One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory"β125Updated 4 months ago
- TempFlow-GRPO (Temporal Flow GRPO), a principled GRPO framework that captures and exploits the temporal structure inherent in flow-based β¦β841Updated last month
- β169Updated 5 months ago
- [NeurIPS 2025] Native-resolution diffusion Transformerβ295Updated 2 months ago
- β100Updated 3 weeks ago
- [Survey] Towards Efficient Large Language Model Serving: A Survey on System-Aware KV Cache Optimizationβ230Updated last week
- Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusionβ297Updated 3 weeks ago
- [ICLR 2025] CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMsβ129Updated 7 months ago
- MOSAIC: Multi-Subject Personalized Generation via Correspondence-Aware Alignment and Disentanglementβ438Updated 2 months ago
- The Personality Illusion: Revealing Dissociation Between Self-Reports & Behavior in LLMs.β101Updated this week
- Personalized Fragrance Recommendation for Aromatherapy: A Machine Learning Approach Based on Personality Traits and Electrodermal Activitβ¦β14Updated 8 months ago
- OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Modelsβ145Updated 8 months ago
- A lightweight and extensible toolkit for visualizing attention flow in Large Vision-Language Models (LVLMs). It renders token-to-token atβ¦β130Updated 3 weeks ago
- This is the official repository for C3-OWD: A Curriculum Cross-modal Contrastive Learning Framework for Open-World Detectionβ157Updated 3 months ago
- β209Updated 8 months ago
- β27Updated 3 months ago
- β111Updated 7 months ago
- Syzygy-of-thoughtsβ217Updated this week
- Multi-Reward as Condition for Instruction-Based Image Editingβ57Updated 9 months ago
- π₯ JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimizationβ259Updated last week
- EvoVLA: Self-Evolving Vision-Language-Action Modelβ219Updated last week
- [AAAI 2026]π₯π₯π₯FocusDPO: Dynamic Preference Optimization for Multi-Subject Personalized Image Generation via Adaptive Focusβ378Updated 2 months ago
- π§© IMAGHarmony π§©: Controllable image editing with consistent object quantity and layout. A structure-aware framework that ensures high fβ¦β678Updated 2 months ago
- A Business-Driven Real-World Financial Benchmark for Evaluating LLMsβ216Updated last month
- Papers list of empathy in LMs: theory, modeling, systems, emotion, evaluation.β83Updated 3 weeks ago
- The official repo of the paper "MMLongBench Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly"β171Updated last month
- MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speechβ271Updated last month