jiyt17/ReDiff

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jiyt17/ReDiff)

jiyt17 / ReDiff

Codebase of 'From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model'

☆45

Alternatives and similar repositories for ReDiff

Users that are interested in ReDiff are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qishisuren123 / AnyCap
View on GitHub
A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…
☆54Jul 24, 2025Updated 11 months ago
ChartMimic / ChartMimic
View on GitHub
[ICLR 2025] ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation
☆132Dec 19, 2025Updated 7 months ago
Multimedia-Analytics-Laboratory / dpdmd
View on GitHub
[ICML 2026] The offical code of Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis
☆87Jun 2, 2026Updated last month
SihengLi99 / LLM-Honesty-Survey
View on GitHub
[2025-TMLR] A Survey on the Honesty of Large Language Models
☆66Dec 8, 2024Updated last year
zss02 / BiPS
View on GitHub
[CVPR 2026] See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning
☆21Jun 28, 2026Updated 3 weeks ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
VINHYU / OpenSpatial
View on GitHub
☆92May 8, 2026Updated 2 months ago
cychomatica / FreeDave
View on GitHub
Free Draft-and-Verification: Toward Lossless Parallel Decoding for Diffusion Large Language Models
☆23May 19, 2026Updated 2 months ago
qishisuren123 / S2L-PO
View on GitHub
[ICML 2026] Smaller Models are Natural Explorers for Policy-Level Diversity in GRPO
☆19Jun 15, 2026Updated last month
maomaocun / dLLM-Var
View on GitHub
The official implementation of dLLM-Var
☆35Nov 6, 2025Updated 8 months ago
alchemistyzz / PeRL
View on GitHub
[NeurIPS'25] The official code of "PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning"
☆30Mar 30, 2026Updated 3 months ago
IIGROUP / AutoIE2
View on GitHub
[NLPCC 2021] Shared Task on AutoIE2: Sub-Event Identification
☆14Jul 19, 2021Updated 5 years ago
TianHongZXY / qaap
View on GitHub
[EMNLP 2023] Question Answering as Programming for Solving Time-Sensitive Questions
☆12Dec 18, 2023Updated 2 years ago
QuenithAI / Diffusion-Large-Language-Models-Paper-List
View on GitHub
Tracking the latest and greatest research papers on diffusion large language models.
☆32Mar 13, 2026Updated 4 months ago
jiyt17 / Prompt-A-Video
View on GitHub
[ICCV 2025] Prompt-A-Video
☆24Feb 2, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zsgvivo / VideoZoomer
View on GitHub
☆34Feb 12, 2026Updated 5 months ago
yang3121099 / LLM-Neo
View on GitHub
The code for paper "LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models"
☆15Mar 2, 2025Updated last year
Cominclip / OmniVerifier
View on GitHub
[ICLR 2026 Oral & ICML 2026] Generative Universal Verifier as Multimodal Meta-Reasoner
☆64May 29, 2026Updated last month
czg1225 / dParallel
View on GitHub
[ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs
☆65Apr 12, 2026Updated 3 months ago
aim-uofa / dLLM-MidTruth
View on GitHub
[ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".
☆66Mar 5, 2026Updated 4 months ago
Shuweis / ResMaster
View on GitHub
☆63Jun 25, 2024Updated 2 years ago
IIGROUP / AttentionProbe
View on GitHub
[ICASSP 2022] Official PyTorch Implementation for "Attention Probe: Vision Transformer Distillation in the Wild" (ICASSP 2022)
☆11Jan 23, 2022Updated 4 years ago
MrZilinXiao / AutoVER
View on GitHub
[ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.
☆14Mar 2, 2024Updated 2 years ago
ToolBeHonest / ToolBeHonest
View on GitHub
[EMNLP 2024] A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models.
☆22Sep 23, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
SihengLi99 / RePO
View on GitHub
RePO: Replay-Enhanced Policy Optimization
☆23Jun 12, 2025Updated last year
microsoft / PixelCraft
View on GitHub
[ICLR 2026] High-Fidelity Visual Reasoning on Structured Images
☆29Updated this week
KirigiriSuzumiya / Phone_det
View on GitHub
基于paddlex目标检测的工业场景下违规使用手机识别。
☆12Jun 11, 2022Updated 4 years ago
jacklishufan / LaViDa
View on GitHub
Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding
☆227Dec 17, 2025Updated 7 months ago
congvvc / LaSagnA
View on GitHub
Project for "LaSagnA: Language-based Segmentation Assistant for Complex Queries".
☆63Apr 29, 2024Updated 2 years ago
JianyuanZhong / StableDRL
View on GitHub
☆15Updated this week
ML-GSAI / ESPO
View on GitHub
Official PyTorch implementation for "Principled RL for Diffusion LLMs Emerges from a Sequence-Level Perspective"
☆39Jan 25, 2026Updated 5 months ago
TianheWu / Assessor360
View on GitHub
[NeurIPS 2023] Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment
☆38Oct 11, 2023Updated 2 years ago
ali-vilab / Wan-Move
View on GitHub
[NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance
☆644Jan 5, 2026Updated 6 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
martian422 / MaskGRPO
View on GitHub
The official implementation of MaskGRPO: Consolidating Reinforcement Learning for Multimodal Discrete Diffusion Models. (ICLR 2026, arxiv…
☆19Jan 27, 2026Updated 5 months ago
Xilluill / MAG
View on GitHub
Memorize-and-Generate: Towards Long-Term Consistency in Real-Time Video Generation
☆18Mar 20, 2026Updated 4 months ago
DavidFanzz / SCMoE
View on GitHub
☆29May 24, 2024Updated 2 years ago
IIGROUP / MAP
View on GitHub
☆38Oct 11, 2022Updated 3 years ago
Candice-yu / GeoLaux
View on GitHub
A Benchmark for Evaluating MLLMs' Geometry Performance on Long-Step Problems Requiring Auxiliary Lines
☆38Apr 27, 2026Updated 2 months ago
jacklishufan / Reflect-DiT
View on GitHub
Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection
☆56Aug 16, 2025Updated 11 months ago
NIneeeeeem / LangDC
View on GitHub
[EMNLP 2025 Oral] Official codebase for Seeing More, Saying More: Lightweight Language Experts are Dynamic Video Token Compressors.
☆18Sep 7, 2025Updated 10 months ago