OPPO-Mente-Lab / DaMoLinks
The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》
☆28Updated last week
Alternatives and similar repositories for DaMo
Users that are interested in DaMo are comparing it to the libraries listed below
Sorting:
- ☆14Updated 10 months ago
 - This is the official repo for the paper "AMO-Bench: Large Language Models Still Struggle in High School Math Competitions".☆16Updated this week
 - This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"☆35Updated last year
 - instruction-following benchmark for large reasoning models☆45Updated 2 months ago
 - A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆32Updated 2 months ago
 - RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated 10 months ago
 - ☆30Updated 2 months ago
 - From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆23Updated 3 weeks ago
 - [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆57Updated last year
 - The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆30Updated last year
 - Extending context length of visual language models☆12Updated 10 months ago
 - Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆39Updated 2 months ago
 - Evaluating the faithfulness of long-context language models☆30Updated last year
 - [ACL 2024] Making Long-Context Language Models Better Multi-Hop Reasoners☆18Updated last year
 - 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆41Updated last year
 - Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"☆18Updated 8 months ago
 - [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆19Updated 8 months ago
 - [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Updated 9 months ago
 - ☆21Updated 6 months ago
 - [ACL 2024 Findings] Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning☆13Updated last year
 - ☆18Updated 11 months ago
 - This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"☆50Updated last year
 - ☆116Updated 2 weeks ago
 - Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…☆52Updated 5 months ago
 - ✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio☆50Updated 3 months ago
 - [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.☆83Updated 8 months ago
 - ☆84Updated last year
 - Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆91Updated last year
 - ☆14Updated 9 months ago
 - ☆19Updated 10 months ago