RainBowLuoCS / DEEMView external linksLinks
(ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.
☆48Jul 1, 2025Updated 7 months ago
Alternatives and similar repositories for DEEM
Users that are interested in DEEM are comparing it to the libraries listed below
Sorting:
- [COLING 2024 (Oral)] PromISe:Releasing the Capabilities of LLMs with Prompt Introspective Search☆23Aug 26, 2024Updated last year
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents☆48Feb 2, 2026Updated last week
- (NIPS 2025) OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Align…☆124Nov 8, 2025Updated 3 months ago
- Repository of IPBench☆19Jan 4, 2026Updated last month
- Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification☆11Nov 15, 2023Updated 2 years ago
- ☆15Apr 13, 2023Updated 2 years ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆91Feb 14, 2025Updated last year
- This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs☆36Mar 9, 2025Updated 11 months ago
- Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…☆65Nov 19, 2024Updated last year
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆41Sep 30, 2024Updated last year
- ☆18May 2, 2024Updated last year
- ☆17Feb 18, 2025Updated 11 months ago
- Towards Defending against Adversarial Examples via Attack-Invariant Features☆12Oct 12, 2023Updated 2 years ago
- ☆19Sep 19, 2024Updated last year
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆21Dec 16, 2024Updated last year
- NeurIPS'2022: Pluralistic Image Completion with Gaussian Mixture Models☆14Jan 28, 2023Updated 3 years ago
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆201Nov 30, 2025Updated 2 months ago
- ICLR‘2021: Robust Early-learning: Hindering the Memorization of Noisy Labels☆78Jun 15, 2021Updated 4 years ago
- [TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”☆46Jan 27, 2024Updated 2 years ago
- ☆21Jan 28, 2023Updated 3 years ago
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…☆24Nov 17, 2024Updated last year
- [ACL'25 Main] Official Implementation of HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Languag…☆48Sep 8, 2025Updated 5 months ago
- Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors☆31Jun 2, 2024Updated last year
- ☆38Nov 13, 2025Updated 3 months ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆39Dec 31, 2024Updated last year
- ☆26Feb 8, 2022Updated 4 years ago
- [NeurIPS 2025] VideoRFT: Incentivizing Video Reasoning Capability in MLLMs via Reinforced Fine-Tuning☆64Jan 6, 2026Updated last month
- [CVPR 2024] EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning☆38Apr 20, 2025Updated 9 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆147Dec 22, 2025Updated last month
- 这是我的博客《不用框架,使用Python搭建基于numpy的卷积神经网络来进行cifar-10分类的深度学习系统》的代码实现。☆10Jul 1, 2019Updated 6 years ago
- This module includes functions that can be used to simulate mechanochemical phenomena.☆11Nov 16, 2021Updated 4 years ago
- All things manipulating, quantifying, and visualizing geochemical data☆12Jan 19, 2024Updated 2 years ago
- ☆39Aug 27, 2022Updated 3 years ago
- [Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey☆477Jan 17, 2025Updated last year
- Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…☆37Apr 13, 2022Updated 3 years ago
- Detect interesting SARS-CoV-2 spike protein variants from Sanger sequencing data.☆11Apr 15, 2022Updated 3 years ago
- Graph neural network for predicting energy of known and hypothetical crystal structures☆10Jan 26, 2022Updated 4 years ago
- ☆15Jan 27, 2026Updated 2 weeks ago