gszfwsb / NCFM
Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function: A Minmax Perspective" (NCFM) in CVPR 2025.
☆318Updated 3 weeks ago
Alternatives and similar repositories for NCFM:
Users that are interested in NCFM are comparing it to the libraries listed below
- Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey☆314Updated this week
- R1-onevision, a visual language model capable of deep CoT reasoning.☆475Updated this week
- Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"☆267Updated this week
- ☆149Updated 7 months ago
- This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-sta…☆404Updated last week
- A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue☆239Updated 2 months ago
- A curated list of papers on the applications of RWKV in computer vision.☆163Updated last month
- MM-EUREKA: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning☆459Updated this week
- Explore the Multimodal “Aha Moment” on 2B Model☆538Updated 2 weeks ago
- 《多模态大模型:新一代人工智能技术范式》作者:刘阳,林倞☆194Updated 3 months ago
- Efficient Multimodal Large Language Models: A Survey☆330Updated 3 weeks ago
- [ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures☆432Updated last month
- Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.☆659Updated this week
- PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437☆1,019Updated last month
- This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-bas…☆522Updated this week
- Official repository for VisionZip (CVPR 2025)☆262Updated last month
- A paper list of some recent works about Token Compress for Vit and VLM☆391Updated this week
- ✨First Open-Source R1-like Video-LLM [2025/02/18]☆301Updated last month
- Pruning the VLLMs☆89Updated 3 months ago
- [Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey☆403Updated 2 months ago
- ☆92Updated this week
- A fork to add multimodal model training to open-r1☆1,139Updated last month
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆283Updated 3 months ago
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆284Updated 2 weeks ago
- Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models☆347Updated last week
- A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.☆484Updated last week
- ☆298Updated last month
- Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’☆1,458Updated last week
- 历年ICLR论文和开源项目合集,包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.☆209Updated 2 weeks ago
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆168Updated 10 months ago