mashijie1028/GenHancer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mashijie1028/GenHancer)

mashijie1028 / GenHancer

(ICCV 2025) Enhance CLIP and MLLM's fine-grained visual representations with generative models.

☆78

Alternatives and similar repositories for GenHancer

Users that are interested in GenHancer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jailflip / jailflip-2025
View on GitHub
☆22Jan 9, 2026Updated 6 months ago
GiantAILab / Video-to-Audio-and-Piano
View on GitHub
☆18May 14, 2025Updated last year
mashijie1028 / TrustDD
View on GitHub
(Pattern Recognition 2025) Towards Trustworthy Dataset Distillation
☆14Dec 8, 2024Updated last year
mashijie1028 / Happy-CGCD
View on GitHub
(NeurIPS 2024) Happy: A Debiased Learning Framework for Continual Generalized Category Discovery
☆46Nov 25, 2025Updated 8 months ago
BIT-DA / ABS
View on GitHub
[ICML2025] Official Code of From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection
☆27Jun 27, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
mashijie1028 / ActiveGCD
View on GitHub
(CVPR 2024) Active Generalized Category Discovery
☆54Oct 17, 2024Updated last year
xihaoCCC / EDA_Data_Science_Job_Market
View on GitHub
Exploring and visualizing the Data Science job market through Glassdoor postings to uncover key insights into industry trends and demands…
☆16Oct 3, 2023Updated 2 years ago
GiantAILab / DeepSound-V1
View on GitHub
Official code for DeepSound-V1
☆12May 14, 2025Updated last year
ltlhuuu / PSEC
View on GitHub
[ICLR 2025] The offical implementation of "PSEC: Skill Expansion and Composition in Parameter Space", a new framework designed to facilit…
☆65Feb 12, 2025Updated last year
Tim-Siu / reinforcement-distillation
View on GitHub
Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"
☆33Jul 25, 2025Updated last year
AuroraZengfh / Local-Prompt
View on GitHub
[ICLR 2025] Official Implementation of Local-Prompt: Extensible Local Prompts for Few-Shot Out-of-Distribution Detection
☆52Jul 30, 2025Updated 11 months ago
ZhishanQ / UniHGKR
View on GitHub
The official repository of UniHGKR: Unified Instruction-aware Heterogeneous Knowledge Retrievers
☆27Jun 12, 2025Updated last year
Tencent / HaploVLM
View on GitHub
ICML2025
☆63Aug 28, 2025Updated 10 months ago
CUHK-Shenzhen-SE / UTBoost
View on GitHub
[ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench
☆36Aug 12, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
NiceRingNode / PartialConvolution
View on GitHub
A non-official re-implementation of article "[ECCV 18] Image Inpainting for Irregular Holes Using Partial Convolutions"
☆12Mar 1, 2025Updated last year
mashijie1028 / ProtoGCD
View on GitHub
(TPAMI 2025) ProtoGCD: Unified and Unbiased Prototype Learning for Generalized Category Discovery
☆41Jun 13, 2025Updated last year
SaraGhazanfari / EMMA
View on GitHub
EMMA [TMLR 2025]
☆14Sep 25, 2025Updated 10 months ago
TencentARC / ARC-Chapter
View on GitHub
Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries
☆44Nov 19, 2025Updated 8 months ago
boyuh / AUCSeg
View on GitHub
This repository is the official code for the paper "AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation" (NeurIPS 2024).
☆14Sep 17, 2025Updated 10 months ago
Ghy0501 / HiDe-LLaVA
View on GitHub
[ACL'25 Main] Official Implementation of HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Languag…
☆55Jun 1, 2026Updated last month
TencentARC / Video-Holmes
View on GitHub
[ECCV 2026] Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?
☆95Jul 13, 2025Updated last year
YutingLi0606 / Vision-Matters
View on GitHub
(ArXiv25) Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning
☆60Sep 30, 2025Updated 9 months ago
OrangeSodahub / InfGen
View on GitHub
[ICCV 2025] Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation.
☆53Aug 27, 2025Updated 10 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
taco-group / LangCoop
View on GitHub
🏆 Official implementation of LangCoop: Collaborative Driving with Natural Language
☆81Sep 12, 2025Updated 10 months ago
hanyang1999 / discrete-diffusion-papers
View on GitHub
A collection of papers on discrete diffusion models
☆164Mar 9, 2026Updated 4 months ago
inFaaa / Awesome-Personalized-Video-Creation
View on GitHub
📖 This is a repository for organizing papers, codes, and other resources related to personalized video generation and editing.
☆64Dec 9, 2025Updated 7 months ago
YS-IMTech / PermaVid
View on GitHub
[Official Code] PermaVid: Consistent Video Generation Across Edits via Disentangled Context Memory
☆43Jun 17, 2026Updated last month
Qiukunpeng / Siamese-Diffusion
View on GitHub
[CVPR 2025] Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation
☆90Nov 29, 2025Updated 7 months ago
eren23 / neo-unify
View on GitHub
Toy-scale unified multimodal model experiments — encoder-free understanding & generation with Mixture-of-Transformers on MLX/Apple Silico…
☆47Mar 8, 2026Updated 4 months ago
NiceRingNode / Awesome-Generative-Models-for-OCR
View on GitHub
[arXiv 25] OCRGenBench: A Comprehensive Benchmark for Evaluating OCR Generative Capabilities
☆273Apr 13, 2026Updated 3 months ago
Bobyue0118 / Constraint-Inference-in-Safe-IRL
View on GitHub
[ICLR 2025] "Understanding Constraint Inference in Safety-Critical Inverse Reinforcement Learning"
☆16Nov 30, 2025Updated 7 months ago
Chen-GX / C-3PO
View on GitHub
[ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…
☆44May 3, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wusize / OpenUni
View on GitHub
☆189Jun 27, 2025Updated last year
czg1225 / CoDe
View on GitHub
[CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient
☆108Sep 27, 2025Updated 9 months ago
Haochen-Wang409 / ross
View on GitHub
[ICLR'25] Reconstructive Visual Instruction Tuning
☆135Apr 9, 2025Updated last year
direction-yxf / Hello-GPT
View on GitHub
接地气的大模型工程，争取成为一本大模型实战百科全书
☆16Oct 16, 2023Updated 2 years ago
NuoJohnChen / XtraGPT
View on GitHub
[ACL 2026 Main] XtraGPT: Context-Aware and Controllable Academic Paper Revision via Human-AI Collaboration
☆25Apr 23, 2026Updated 3 months ago
NiceRingNode / LGGPT
View on GitHub
[IJCV 2025] Smaller But Better: Unifying Layout Generation with Smaller Large Language Models
☆158Aug 3, 2025Updated 11 months ago
rekkles2 / Fed_WSVAD
View on GitHub
[IEEE TII 2025] Official Implementation for "Dual-Detector Reoptimization for Federated Weakly Supervised Video Anomaly Detection via Ada…
☆27Nov 11, 2025Updated 8 months ago