beccabai / multi-agent-data-selection

This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.

☆40

Alternatives and similar repositories for multi-agent-data-selection

Users that are interested in multi-agent-data-selection are comparing it to the libraries listed below

Sorting:

HZQ950419 / Math-LLaVA
Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models
☆84Updated 10 months ago
RM-R1-UIUC / RM-R1
☆63Updated last week
SihengLi99 / SEALONG
Large Language Models Can Self-Improve in Long-context Reasoning
☆69Updated 5 months ago
mathllm / MATH-V
[NeurIPS 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.
☆106Updated this week
NuoJohnChen / JudgeLRM
☆26Updated last month
IDEA-FinAI / RagVL
Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …
☆75Updated 6 months ago
EvolvingLMMs-Lab / multimodal-search-r1
☆97Updated last month
VisualWebBench / VisualWebBench
Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"
☆56Updated 6 months ago
Liuziyu77 / MIA-DPO
Official implement of MIA-DPO
☆57Updated 3 months ago
TIGER-AI-Lab / VL-Rethinker
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"
☆90Updated last week
John-AI-Lab / NoisyRollout
NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
☆54Updated last week
tianyi-lab / MoE-Embedding
Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"
☆68Updated 7 months ago
shiqichen17 / VLM_Merging
Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)
☆22Updated last week
AIDC-AI / Parrot
🎉 The code repository for "Parrot: Multilingual Visual Instruction Tuning" in PyTorch.
☆40Updated 2 weeks ago
RifleZhang / LLaVA-Reasoner-DPO
☆75Updated 4 months ago
princeton-nlp / CharXiv
[NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
☆113Updated 3 weeks ago
OpenGVLab / MMIU
[ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
☆71Updated 8 months ago
thunlp / DeepPerception
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding
☆54Updated last month
ECNU-ICALK / EduChat-Math
☆30Updated 6 months ago
MME-Benchmarks / MME-RealWorld
✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
☆118Updated 2 months ago
IDEA-FinAI / ChartMoE
[ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding
☆78Updated last month
RUCAIBox / Virgo
Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*
☆100Updated 2 months ago
opendatalab / LOKI
[ICLR 2025 Spotlight] The official implementation of the paper “LOKI：A Comprehensive Synthetic Data Detection Benchmark using Large Multi…
☆149Updated last month
beichenzbc / BoostStep
official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"
☆35Updated 3 months ago
yihedeng9 / OpenVLThinker
OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement
☆84Updated last week
njucckevin / MM-Self-Improve
A Self-Training Framework for Vision-Language Reasoning
☆78Updated 3 months ago
YangLing0818 / SuperCorrect-llm
[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
☆69Updated last month
Wang-ML-Lab / multimodal-needle-in-a-haystack
[NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models
☆43Updated 2 weeks ago
LightChen233 / M3CoT
☆73Updated 11 months ago
mayubo2333 / MMLongBench-Doc
Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations
☆80Updated 10 months ago