allenai / dolma3Links
☆35Updated 2 weeks ago
Alternatives and similar repositories for dolma3
Users that are interested in dolma3 are comparing it to the libraries listed below
Sorting:
- Official Implementation of "Simulating Environments with Reasoning Models for Agent Training"☆46Updated this week
- ☆31Updated last year
- Verifiers for LLM Reinforcement Learning☆80Updated 8 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆81Updated last year
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆108Updated 9 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆51Updated 6 months ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated last year
- Reformatted Alignment☆113Updated last year
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆119Updated 7 months ago
- ☆46Updated 6 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆89Updated last year
- [ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches☆58Updated 9 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated 2 weeks ago
- RL Scaling and Test-Time Scaling (ICML'25)☆112Updated 10 months ago
- [EMNLP 2025] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward☆58Updated 4 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Updated last year
- ☆52Updated last year
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆21Updated 3 years ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆104Updated 2 months ago
- [ACL 2025 Main] Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Models☆55Updated 6 months ago
- ☆105Updated last year
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Updated 8 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆60Updated last year
- This repo explores how AMR to address tasks difficult for LLMs☆13Updated last year
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆59Updated last year
- ☆67Updated 8 months ago
- Aioli: A unified optimization framework for language model data mixing☆31Updated 11 months ago
- Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆48Updated 5 months ago
- ☆95Updated last year
- Trending projects & awesome papers about data-centric llm studies.☆38Updated 7 months ago