double-ai / formulaone-dataset-releaseLinks
FormulaOne: A dataset of algorithmic problems based on MSO formulas.
☆22Updated 4 months ago
Alternatives and similar repositories for formulaone-dataset-release
Users that are interested in formulaone-dataset-release are comparing it to the libraries listed below
Sorting:
- ☆29Updated last month
- ☆19Updated 9 months ago
- ⚔️ Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆46Updated 6 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆37Updated 3 weeks ago
- ☆145Updated last week
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Updated 2 months ago
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆46Updated 5 months ago
- Marketplace ML experiment - training without backprop☆27Updated 3 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated 2 weeks ago
- ☆107Updated last month
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆25Updated last year
- ☆15Updated last week
- ☆22Updated last year
- The official GitHub Page for MiniMax☆60Updated last month
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆35Updated 2 months ago
- The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"☆304Updated this week
- ☆31Updated 9 months ago
- A truly open version of gpt-oss which shows the entire pre-training from scratch☆80Updated 3 months ago
- ☆131Updated 7 months ago
- 🍨 Gelato — From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents☆33Updated this week
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆168Updated 4 months ago
- Moondream MCP Server in Python☆43Updated 5 months ago
- ☆21Updated 5 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 7 months ago
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Updated 5 months ago
- ☆19Updated 7 months ago
- ☆20Updated 9 months ago
- ☆185Updated last month
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆54Updated 10 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆37Updated last month