microsoft / DELTLinks
DELT: Data Efficacy for Language Model Training
☆34Updated 3 weeks ago
Alternatives and similar repositories for DELT
Users that are interested in DELT are comparing it to the libraries listed below
Sorting:
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆43Updated last year
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆108Updated 3 months ago
- Geometric-Mean Policy Optimization☆80Updated last month
- MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment☆35Updated last year
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆110Updated 2 months ago
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆141Updated 5 months ago
- SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward☆79Updated last month
- [ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding☆57Updated 9 months ago
- Official implementation of "Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology"☆64Updated 2 months ago
- Official implementation of "PyVision: Agentic Vision with Dynamic Tooling."☆127Updated 2 months ago
- ☆34Updated last month
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆48Updated 2 months ago
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆43Updated 2 months ago
- MiroTrain is an efficient and algorithm-first framework for post-training large agentic models.☆87Updated 3 weeks ago
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆29Updated last year
- [NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆62Updated this week
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆146Updated 3 months ago
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆24Updated 3 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 8 months ago
- ☆70Updated 3 months ago
- The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning☆312Updated 3 months ago
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO☆48Updated last week
- ☆55Updated last month
- "what, how, where, and how well? a survey on test-time scaling in large language models" repository☆67Updated this week
- ☆56Updated 2 months ago
- G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning☆82Updated 4 months ago
- Open-Pandora: On-the-fly Control Video Generation☆34Updated 9 months ago
- [ICCV 2025] Dynamic-VLM☆25Updated 9 months ago
- [Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics]: VisuoThink: Empowering LVLM Reasoning with Mul…☆29Updated 2 months ago
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆53Updated last week