☆27Mar 3, 2025Updated last year
Alternatives and similar repositories for wyd-benchmark
Users that are interested in wyd-benchmark are comparing it to the libraries listed below
Sorting:
- ☆13Jul 10, 2024Updated last year
- ☆15Mar 30, 2025Updated 11 months ago
- Transferring Genshin PVs into a freehand style with Diffusion Model.☆10Jun 5, 2024Updated last year
- [CVPR 2025] GPS as a Control Signal for Image Generation☆25Mar 18, 2025Updated last year
- Code for "Kuramoto Orientation Diffusion"☆27Nov 7, 2025Updated 4 months ago
- ☆34Dec 16, 2025Updated 3 months ago
- ☆25Aug 19, 2025Updated 7 months ago
- The public source code of "FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling"☆30Jul 7, 2025Updated 8 months ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 8 months ago
- [ICCV 2025] Official Implementation of "Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation". Junyu Xie, Tengda H…☆21Jul 26, 2025Updated 7 months ago
- ☆12Jul 27, 2024Updated last year
- repo for PIANO: A Parametric Hand Bone Model from Magnetic Resonance Imaging☆27Jul 11, 2024Updated last year
- InstructG2I: Synthesizing Images from Multimodal Attributed Graphs (NeurIPs 2024)☆20Oct 17, 2024Updated last year
- ☆18Jan 19, 2026Updated 2 months ago
- Consistent Human Image and Video Generation with Spatially Conditioned Diffusion☆16Sep 1, 2025Updated 6 months ago
- Official implementation of “ACE: Anti-Editing Concept Erasure in Text-to-Image Models”☆14Jan 5, 2026Updated 2 months ago
- GUESS: GradUally Enriching SyntheSis for Text-Driven Human Motion Generation ( IEEE Transactions on Visualization and Computer Graphics, …☆34Jan 29, 2024Updated 2 years ago
- ☆29Mar 24, 2025Updated 11 months ago
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆79Jul 29, 2025Updated 7 months ago
- [ICLR 2026] RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation☆32Mar 10, 2026Updated last week
- RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space☆39Oct 16, 2025Updated 5 months ago
- Developer project for getting basic API integrations working in under 5 minutes☆11Jan 30, 2026Updated last month
- ☆13May 17, 2025Updated 10 months ago
- ☆34Feb 16, 2025Updated last year
- This repository is dedicated to Track 2 of the W-CODA 2024 Workshop, "Multimodal Perception and Comprehension of Corner Cases in Autonomo…☆17Jun 12, 2024Updated last year
- [NeurIPS 2023 Datasets and Benchmarks] "FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation", Yuanxin L…☆57Mar 4, 2024Updated 2 years ago
- Semi-Supervised Fine-Grained Recognition Challenge at FGVC8☆29Nov 24, 2021Updated 4 years ago
- ☆17Apr 17, 2025Updated 11 months ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- [CVPR 2025] Multi-focal Conditioned Latent Diffusion for Person Image Synthesis☆22Mar 23, 2025Updated 11 months ago
- CamReasoner: Reinforcing Camera Movement Understanding via Structured Spatial Reasoning☆28Feb 11, 2026Updated last month
- ☆17May 29, 2024Updated last year
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 4 months ago
- Use Blender for figures.☆15Feb 11, 2026Updated last month
- Official repository for "PosterO: Structuring Layout Trees to Enable Language Models in Generalized Content-Aware Layout Generation" (CVP…☆19Nov 12, 2025Updated 4 months ago
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆23Jul 30, 2025Updated 7 months ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- [NeurIPS 2025] The official code for "IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation"☆22Jun 5, 2025Updated 9 months ago