double-ai / formulaone-dataset-releaseLinks
FormulaOne: A dataset of algorithmic problems based on MSO formulas.
☆23Updated 5 months ago
Alternatives and similar repositories for formulaone-dataset-release
Users that are interested in formulaone-dataset-release are comparing it to the libraries listed below
Sorting:
- ☆29Updated 2 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆38Updated last month
- ☆159Updated last month
- ☆19Updated 10 months ago
- Extending the Context of Pretrained LLMs by Dropping Their Positional Embedding☆193Updated 2 weeks ago
- ☆31Updated 10 months ago
- ☆137Updated 8 months ago
- ☆20Updated 6 months ago
- ☆15Updated last month
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆25Updated last year
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Updated 3 months ago
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆45Updated 6 months ago
- ☆107Updated 2 months ago
- The official GitHub Page for MiniMax☆61Updated 2 months ago
- ☆19Updated 8 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆36Updated 3 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆102Updated 4 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆168Updated 5 months ago
- Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …☆50Updated last week
- ⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆49Updated this week
- Implementation of the MetaController proposed in "Emergent temporal abstractions in autoregressive models enable hierarchical reinforceme…☆87Updated this week
- Sparse Inferencing for transformer based LLMs☆218Updated 5 months ago
- Marketplace ML experiment - training without backprop☆27Updated 4 months ago
- Lego for GRPO☆30Updated 8 months ago
- A truly open version of gpt-oss which shows the entire pre-training from scratch☆85Updated 4 months ago
- qwen3 experiments☆34Updated 6 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Updated 6 months ago
- Official Repository of Native Parallel Reasoner☆100Updated last week
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆38Updated 2 months ago