☆14Jul 5, 2024Updated last year
Alternatives and similar repositories for MPS
Users that are interested in MPS are comparing it to the libraries listed below
Sorting:
- ☆13Jan 22, 2025Updated last year
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆15Jun 4, 2025Updated 8 months ago
- [CVPRW2024, Official Code] for paper "Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribu…☆14Jun 14, 2024Updated last year
- ☆15Mar 30, 2025Updated 11 months ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆153Jun 25, 2024Updated last year
- ☆84Oct 10, 2025Updated 4 months ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆40May 9, 2024Updated last year
- [TMM] MINT-IQA: Quality Assessment for AI Generated Images with Instruction Tuning☆20Nov 21, 2025Updated 3 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆81Apr 10, 2024Updated last year
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆46Nov 25, 2025Updated 3 months ago
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆32Mar 26, 2025Updated 11 months ago
- ☆27May 23, 2025Updated 9 months ago
- Official pytorch implementation for SingleInsert☆28Apr 19, 2024Updated last year
- [MM 2024 Oral] Refiner for AIGC☆29Jul 29, 2024Updated last year
- MC$^2$: Multi-concept Guidance for Customized Multi-concept Generation☆31Apr 3, 2024Updated last year
- [ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"☆84Jul 4, 2024Updated last year
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆35Mar 12, 2024Updated last year
- ☆21Dec 14, 2025Updated 2 months ago
- Test-Time Distribution Normalization For Contrastively Learned Vision-language Models☆27Jan 15, 2024Updated 2 years ago
- Code for our Paper "All in an Aggregated Image for In-Image Learning"☆29Apr 9, 2024Updated last year
- [ACL 2025] The official pytorch implement of "MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection".☆26May 26, 2025Updated 9 months ago
- Our 2nd-gen LMM☆34May 22, 2024Updated last year
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆82Jun 11, 2024Updated last year
- ☆34Dec 29, 2025Updated 2 months ago
- A free and open-source focus stacking software that supports multi-focus image alignment and fusion.☆19Feb 5, 2026Updated 3 weeks ago
- [CVPR 2020] A generative model with latent factors that are independent and localized.☆12Mar 27, 2025Updated 11 months ago
- InstantUnify: Integrates Multimodal LLM into Diffusion Models 🔥☆40Aug 8, 2024Updated last year
- DREAM: Diffusion Rectification and Estimation-Adaptive Models (CVPR 2024)☆41Feb 3, 2025Updated last year
- [ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation☆116Oct 7, 2025Updated 4 months ago
- Code for Continuously Changing Corruptions (CCC) benchmark + evaluation☆41Aug 21, 2024Updated last year
- ☆36Dec 19, 2022Updated 3 years ago
- CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.☆48Nov 3, 2025Updated 3 months ago
- [ISBI 2024] Official PyTorch implementation of Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Seg…☆11Aug 12, 2024Updated last year
- Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO☆92Dec 1, 2025Updated 3 months ago
- ☆43Dec 1, 2025Updated 2 months ago
- This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV …☆24Dec 4, 2025Updated 2 months ago
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆29Sep 12, 2025Updated 5 months ago
- ☆13Aug 28, 2024Updated last year
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO☆79Oct 29, 2025Updated 4 months ago