Aitchson-Hwang / MNetLinks
[Neural Networks 2025] The official code for the paper "MNet: A Multi-Scale Network for Visible Watermark Removal."
☆17Updated 7 months ago
Alternatives and similar repositories for MNet
Users that are interested in MNet are comparing it to the libraries listed below
Sorting:
- [ICCV25 Highlight] The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"☆74Updated 3 months ago
- UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation☆123Updated last month
- [NeurIPS 2025 DB] OneIG-Bench is a meticulously designed comprehensive benchmark framework for fine-grained evaluation of T2I models acro…☆106Updated last month
- [NeurIPS 2025 Spotlight] VisualQuality-R1 is the first open-sourced NR-IQA model can accurately describe and rate the image quality.☆153Updated 3 months ago
- ☆26Updated last year
- Official code of SmartEdit [CVPR-2024 Highlight]☆370Updated last year
- This is a collection of recent papers on reasoning in video generation models.☆95Updated last month
- Official implementation of NeurIPS'24 paper Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features☆38Updated 8 months ago
- [CVPRW 2025] UniToken is an auto-regressive generation model that combines discrete and continuous representations to process visual inpu…☆105Updated 9 months ago
- [NeurIPS 2024] Official Code for the Paper "Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning"☆26Updated 10 months ago
- ☆48Updated last year
- [ICCV 2025] Fine-Tuning Visual Autoregressive Models for Subject-Driven Generation☆25Updated 5 months ago
- Q-Insight Family: Q-Insight, VQ-Insight and RALI (NeurIPS 2025 Spotlight, AAAI 2026 Oral, and ICLR 2026 Oral)☆246Updated this week
- ArtiMuse: Fine-Grained Image Aesthetics Assessment with Joint Scoring and Expert-Level Understanding(书生 · 妙析多模态美学理解大模型)☆123Updated 3 weeks ago
- VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning☆270Updated 9 months ago
- 🚀 Cross attention map tools for huggingface/diffusers☆388Updated last week
- ☆66Updated last week
- Q-Insight is open-sourced at https://github.com/bytedance/Q-Insight. This repository will not receive further updates.☆142Updated 8 months ago
- Record some basic training on the stable diffusion series, including Lora, Controlnet, IP-adapter, and a bit of fun AIGC play!☆47Updated last year
- A curated list of awesome Multimodal studies.☆312Updated last month
- [CVPR 2025] Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution☆220Updated last month
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆114Updated 7 months ago
- The code repository of Adv-GRPO☆67Updated last month
- [NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆140Updated this week
- [CVPR 2025] RAP: Retrieval-Augmented Personalization☆79Updated 2 months ago
- 【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"☆214Updated 10 months ago
- Official implementation of the paper "Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirect…☆208Updated 9 months ago
- Official repository of the paper "A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models"☆90Updated 5 months ago
- [CVPR 2025] DreamRelation: Bridging Customization and Relation Generation☆19Updated last month
- [ICLR 2026] Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potenti…☆361Updated last week