The State Of The Art, intelligence
☆157Aug 12, 2025Updated 6 months ago
Alternatives and similar repositories for Crux1
Users that are interested in Crux1 are comparing it to the libraries listed below
Sorting:
- An Infr app that helps you replay & talk to everything you've ever seen.☆15Sep 19, 2023Updated 2 years ago
- ☆16Oct 2, 2022Updated 3 years ago
- ☆12May 20, 2025Updated 9 months ago
- ☆15Apr 26, 2025Updated 10 months ago
- ☆11May 18, 2025Updated 9 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆66May 5, 2025Updated 9 months ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 4 months ago
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆17Jun 29, 2025Updated 8 months ago
- ☆17Apr 20, 2025Updated 10 months ago
- Pokedex for LLMs☆14Apr 14, 2025Updated 10 months ago
- Exploring Applications of GRPO☆251Aug 25, 2025Updated 6 months ago
- [ICLR2026] Test-Time Scaling with Reflective Generative Model☆301Jan 28, 2026Updated last month
- General benchmarking apparatus for running multi-agent systems against benchmarks☆42Jan 29, 2026Updated last month
- qwen3 experiments☆34Jul 1, 2025Updated 8 months ago
- A benchmark for conversational bargaining by language models. In each 20‑round match one LLM plays buyer, one plays seller, and both hold…☆34Aug 21, 2025Updated 6 months ago
- Portfolio REgret for Confidence SEquences☆20Jan 6, 2026Updated last month
- ☆19Mar 3, 2025Updated last year
- ☆21Oct 8, 2024Updated last year
- ☆111Jan 27, 2026Updated last month
- One click away from a locally downloaded, fine-tuned model, hosted on hugging face, with inference built in. In two hours.☆24Nov 9, 2025Updated 3 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆308Dec 6, 2025Updated 2 months ago
- Simple AI-based (Claude-3) game emulator☆49May 14, 2024Updated last year
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆26Feb 11, 2026Updated 2 weeks ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆102Jul 19, 2025Updated 7 months ago
- ☆25Aug 8, 2023Updated 2 years ago
- Lego for GRPO☆30May 27, 2025Updated 9 months ago
- ☆114Jul 1, 2025Updated 8 months ago
- A recursive coding agent inpired by RLMs☆142Updated this week
- Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics☆71Jan 13, 2026Updated last month
- ☆76Feb 18, 2026Updated last week
- Port of Facebook's LLaMA model in C/C++☆32Mar 7, 2024Updated last year
- All information and news with respect to Falcon-H1 series☆108Oct 9, 2025Updated 4 months ago
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle☆303Dec 16, 2025Updated 2 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 2 months ago
- Build your own visual reasoning model☆419Jan 13, 2026Updated last month
- ☆13Dec 30, 2024Updated last year
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Jul 3, 2025Updated 8 months ago
- ☆71Oct 23, 2025Updated 4 months ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆238Feb 24, 2025Updated last year