gszfwsb / NCFMLinks

Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function: A Minmax Perspective" (NCFM) in CVPR 2025 (Highlight).

☆375

Alternatives and similar repositories for NCFM

Users that are interested in NCFM are comparing it to the libraries listed below

Sorting:

JunyaoHu / academic-project-page-template-vue
A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue
☆273Updated 2 weeks ago
juzhengz / LoRI
[COLM 2025] LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation
☆132Updated this week
JackYFL / awesome-VLLMs
This repository collects papers on VLLM applications. We will update new papers irregularly.
☆145Updated last month
LeiyiHU / mona
The official implementation of [CVPR 2025] "5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks".
☆335Updated 2 weeks ago
dvlab-research / Seg-Zero
Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"
☆452Updated last month
dvlab-research / VisionReasoner
The official implement of "VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning"
☆222Updated last month
ZJU-LLMs / Awesome-LoRAs
☆178Updated 11 months ago
HITsz-TMG / Awesome-Large-Multimodal-Reasoning-Models
The development and future prospects of multimodal reasoning models.
☆431Updated last week
AIDC-AI / Awesome-Unified-Multimodal-Models
Awesome Unified Multimodal Models
☆414Updated last week
Visual-Agent / DeepEyes
☆596Updated last week
zhengxuJosh / Awesome-RAG-Vision
Awesome-RAG-Vision: a curated list of advanced retrieval augmented generation (RAG) for Computer Vision
☆192Updated last week
swordlidev / Efficient-Multimodal-LLMs-Survey
Efficient Multimodal Large Language Models: A Survey
☆360Updated 2 months ago
yaotingwangofficial / Awesome-MCoT
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
☆695Updated 2 weeks ago
OpenGVLab / Vision-RWKV
[ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
☆477Updated 4 months ago
Clin0212 / HydraLoRA
[NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning
☆216Updated 7 months ago
LMM101 / Awesome-Multimodal-Next-Token-Prediction
[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
☆446Updated 5 months ago
Chongjie-Si / Subspace-Tuning
A generalized framework for subspace tuning methods in parameter efficient fine-tuning.
☆147Updated 2 weeks ago
Jasonlee1995 / ImageNet-1K
ImageNet-1K data download, processing for using as a dataset
☆100Updated 2 years ago
Fancy-MLLM / R1-Onevision
R1-onevision, a visual language model capable of deep CoT reasoning.
☆541Updated 3 months ago
zhengli97 / Awesome-Prompt-Adapter-Learning-for-VLMs
A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.
☆611Updated last week
yfzhang114 / Awesome-Multimodal-Large-Language-Models
Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models
☆475Updated last month
zli12321 / Vision-Language-Models-Overview
A most Frontend Collection and survey of vision-language model papers, and models GitHub repository
☆259Updated this week
xmindflow / Awesome_Mamba
Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis
☆238Updated 5 months ago
xuyang-liu16 / Awesome-Token-level-Model-Compression
📚 Collection of token-level model compression resources.
☆135Updated last week
Osilly / Vision-R1
This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-sta…
☆637Updated 2 weeks ago
saccharomycetes / mllms_know
[ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'
☆224Updated 2 months ago
zhengli97 / PromptKD
[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"
☆316Updated last week
nnnth / UFO
Official implementation of 🛸 "UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface"
☆205Updated last month
deepcs233 / Visual-CoT
[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …
☆336Updated 6 months ago
yinizhilian / ICLR2025-Papers-with-Code
历年ICLR论文和开源项目合集，包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.
☆361Updated 3 months ago