Relaxed-System-Lab / multi-actor-data-selectionLinks

This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.

☆43

Alternatives and similar repositories for multi-actor-data-selection

Users that are interested in multi-actor-data-selection are comparing it to the libraries listed below

Sorting:

SihengLi99 / SEALONG
Large Language Models Can Self-Improve in Long-context Reasoning
☆70Updated 7 months ago
HZQ950419 / Math-LLaVA
Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models
☆84Updated 11 months ago
RM-R1-UIUC / RM-R1
RM-R1: Unleashing the Reasoning Potential of Reward Models
☆108Updated 3 weeks ago
pengshuai-rin / MultiMath
MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
☆29Updated 5 months ago
VisualWebBench / VisualWebBench
Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"
☆57Updated 8 months ago
THU-KEG / AdaptThink
☆116Updated last month
NUS-TRAIL / NoisyRollout
NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
☆69Updated 3 weeks ago
thunlp / ChartCoder
[ACL'25 Main] ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation
☆51Updated this week
beichenzbc / BoostStep
official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"
☆35Updated 5 months ago
NuoJohnChen / JudgeLRM
☆29Updated 2 months ago
RUCAIBox / Virgo
Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*
☆104Updated last month
MME-Benchmarks / MME-RealWorld
✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
☆124Updated 3 months ago
IDEA-FinAI / ChartMoE
[ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding
☆84Updated 2 months ago
OpenGVLab / MM-NIAH
[NeurIPS 2024] Needle In A Multimodal Haystack (MM-NIAH): A comprehensive benchmark designed to systematically evaluate the capability of…
☆117Updated 7 months ago
TIGER-AI-Lab / VL-Rethinker
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"
☆119Updated 3 weeks ago
yhy-2000 / VideoDeepResearch
☆58Updated this week
princeton-nlp / CharXiv
[NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
☆119Updated 2 months ago
EvolvingLMMs-Lab / multimodal-search-r1
☆112Updated this week
ShadeCloak / ADORA
☆46Updated 2 months ago
Dongping-Chen / MLLM-Judge
[ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.
☆70Updated 4 months ago
njucckevin / MM-Self-Improve
A Self-Training Framework for Vision-Language Reasoning
☆80Updated 5 months ago
IDEA-FinAI / RagVL
Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …
☆78Updated 7 months ago
allenai / pixmo-docs
ACL 2025: Synthetic data generation pipelines for text-rich images.
☆82Updated 3 months ago
JLZhong23 / awesome-reward-models
☆64Updated 3 weeks ago
OpenGVLab / ChartAst
[ACL 2024] ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.
☆119Updated 9 months ago
shiqichen17 / VLM_Merging
Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)
☆62Updated 3 weeks ago
hewei2001 / ReachQA
Code & Dataset for Paper: "Distill Visual Chart Reasoning Ability from LLMs to MLLMs"
☆54Updated 7 months ago
haon-chen / mmE5
☆49Updated 4 months ago
yihedeng9 / OpenVLThinker
OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement
☆91Updated last month
MileBench / MileBench
This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"
☆35Updated 11 months ago