arcprize / ARC-AGI-3-AgentsLinks
☆61Updated this week
Alternatives and similar repositories for ARC-AGI-3-Agents
Users that are interested in ARC-AGI-3-Agents are comparing it to the libraries listed below
Sorting:
- Testing baseline LLMs performance across various models☆297Updated last week
- ☆88Updated 2 months ago
- ☆130Updated 4 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 7 months ago
- General multi-task deep RL Agent☆184Updated last year
- ☆215Updated last month
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated 10 months ago
- ☆56Updated last month
- Decentralized RL Training at Scale☆416Updated this week
- Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok …☆24Updated last month
- Train your own SOTA deductive reasoning model☆104Updated 5 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆325Updated last month
- ☆118Updated 7 months ago
- Plotting (entropy, varentropy) for small LMs☆98Updated 2 months ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆238Updated last week
- look how they massacred my boy☆63Updated 9 months ago
- rl from zero pretrain, can it be done? yes.☆193Updated this week
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆324Updated 8 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆72Updated 4 months ago
- Code for ExploreTom☆84Updated last month
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆117Updated last month
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆95Updated 2 weeks ago
- Exploring Applications of GRPO☆245Updated last month
- ☆415Updated 2 months ago
- Open source interpretability artefacts for R1.☆157Updated 3 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆104Updated 5 months ago
- accompanying material for sleep-time compute paper☆102Updated 3 months ago
- ☆66Updated 2 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆45Updated 3 months ago
- An automated tool for discovering insights from research papaer corpora☆138Updated last year