allenai / dolma3Links
☆43Updated last week
Alternatives and similar repositories for dolma3
Users that are interested in dolma3 are comparing it to the libraries listed below
Sorting:
- Verifiers for LLM Reinforcement Learning☆80Updated 9 months ago
- ☆67Updated 10 months ago
- Tooling for exact and MinHash deduplication of large-scale text datasets☆60Updated 2 weeks ago
- ☆45Updated 5 months ago
- [ICLR 2026] Efficient Agent Training for Computer Use☆135Updated 4 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆81Updated 2 years ago
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023☆36Updated 2 years ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆114Updated 3 months ago
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆63Updated last month
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Updated last year
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆90Updated 2 years ago
- This is the official repository for Inheritune.☆120Updated 11 months ago
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆43Updated last year
- ☆34Updated last year
- Official Implementation of "Simulating Environments with Reasoning Models for Agent Training"☆56Updated last week
- RL Scaling and Test-Time Scaling (ICML'25)☆112Updated last year
- ☆96Updated last year
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Updated 9 months ago
- ☆100Updated 5 months ago
- ☆31Updated last year
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆107Updated 10 months ago
- [EMNLP 2025] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward☆62Updated 5 months ago
- Data mapping framework for rust stuff☆44Updated this week
- o1 Chain of Thought Examples☆33Updated last year
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆190Updated 6 months ago
- ☆92Updated 8 months ago
- Data Synthesis for Deep Research Based on Semi-Structured Data☆196Updated last month
- ☆46Updated 7 months ago
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆109Updated 7 months ago
- Reformatted Alignment☆111Updated last year