(ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.
☆49Jul 1, 2025Updated 8 months ago
Alternatives and similar repositories for DEEM
Users that are interested in DEEM are comparing it to the libraries listed below
Sorting:
- [COLING 2024 (Oral)] PromISe:Releasing the Capabilities of LLMs with Prompt Introspective Search☆23Aug 26, 2024Updated last year
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆60Jul 23, 2024Updated last year
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents☆48Feb 2, 2026Updated last month
- Repository of IPBench☆19Jan 4, 2026Updated 2 months ago
- ☆28Oct 28, 2024Updated last year
- Regularly Truncated M-estimators for Learning with Noisy Labels☆11Apr 24, 2024Updated last year
- Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification☆11Nov 15, 2023Updated 2 years ago
- ☆15Apr 13, 2023Updated 2 years ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆92Feb 14, 2025Updated last year
- This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs☆37Mar 9, 2025Updated 11 months ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆41Sep 30, 2024Updated last year
- ☆18May 2, 2024Updated last year
- ☆18Feb 18, 2025Updated last year
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆22Dec 16, 2024Updated last year
- Towards Defending against Adversarial Examples via Attack-Invariant Features☆12Oct 12, 2023Updated 2 years ago
- ☆19Sep 19, 2024Updated last year
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆202Nov 30, 2025Updated 3 months ago
- TPAMI: Classification with noisy labels by importance reweighting.☆39Oct 4, 2019Updated 6 years ago
- ICLR‘2021: Robust Early-learning: Hindering the Memorization of Noisy Labels☆78Jun 15, 2021Updated 4 years ago
- [TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”☆46Jan 27, 2024Updated 2 years ago
- ☆31Sep 12, 2025Updated 5 months ago
- ☆21Jan 28, 2023Updated 3 years ago
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…☆24Nov 17, 2024Updated last year
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- Removing Adversarial Noise in Class Activation Feature Space☆14Oct 12, 2023Updated 2 years ago
- [ACL'25 Main] Official Implementation of HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Languag…☆49Feb 16, 2026Updated 2 weeks ago
- CVPR2023:Uncertainty-Aware Optimal Transport for Semantically Coherent Out-of-Distribution Detection☆26Mar 27, 2023Updated 2 years ago
- [MM 2025] Towards Modality Generalization: A Benchmark and Prospective Analysis☆28May 22, 2025Updated 9 months ago
- ☆38Nov 13, 2025Updated 3 months ago
- Graph Neural Network architecture to solve the decision version of the graph coloring problem (GCP)☆25Jan 27, 2020Updated 6 years ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆39Dec 31, 2024Updated last year
- [CVPR 2024] EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning☆39Apr 20, 2025Updated 10 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆147Dec 22, 2025Updated 2 months ago
- ☆31Mar 24, 2023Updated 2 years ago
- [Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey☆476Jan 17, 2025Updated last year
- Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…☆37Apr 13, 2022Updated 3 years ago
- pytorch☆10Apr 13, 2022Updated 3 years ago
- A simple OpenGL 3.2 example using MSVS 2010 and freeglut☆12Feb 4, 2013Updated 13 years ago
- Scotch pipeline for indel calling.☆10Nov 25, 2019Updated 6 years ago