andyljones / boardlawView external linksLinks
Scaling scaling laws with board games.
☆53Jul 17, 2023Updated 2 years ago
Alternatives and similar repositories for boardlaw
Users that are interested in boardlaw are comparing it to the libraries listed below
Sorting:
- Fast dataset format and loader☆23Jan 2, 2026Updated last month
- Experiments from "The Description Length of Deep Learning Models"☆10Aug 1, 2018Updated 7 years ago
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- ☆30Dec 2, 2024Updated last year
- ☆12Jul 8, 2023Updated 2 years ago
- Simple pytorch net evaluator with Bad Gyal 8 and Mean Girl 8 net included.☆10Nov 23, 2020Updated 5 years ago
- Fast and reliable distributed systems in Python☆33Jan 12, 2026Updated last month
- ☆13Feb 25, 2025Updated 11 months ago
- A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)☆14Feb 3, 2023Updated 3 years ago
- Levin tree search guided by both a policy and a heuristic function☆19Jul 13, 2023Updated 2 years ago
- Neural network visualization toolkit for keras☆16Sep 17, 2018Updated 7 years ago
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- Scaling Data-Constrained Language Models☆340Jun 28, 2025Updated 7 months ago
- code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning☆20Jul 16, 2024Updated last year
- This repository contains the replication of the iGSM dataset generation process from the Physics of LLM paper by Zeyuan Zhu.☆17Sep 13, 2024Updated last year
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Dec 29, 2024Updated last year
- different AI algorithms to solve board games☆19Nov 4, 2018Updated 7 years ago
- A collection of MuJoCo based environments.☆20Nov 30, 2020Updated 5 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- An evaluation suite for Retrieval-Augmented Generation (RAG).☆23Apr 26, 2025Updated 9 months ago
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Mar 1, 2024Updated last year
- Playground for reinforcement learning algorithms implemented in TensorFlow☆16Oct 18, 2016Updated 9 years ago
- Code for the paper "Function-Space Learning Rates"☆25Jun 3, 2025Updated 8 months ago
- Multi-Agent Reinforcement Learning with Stable-Baselines3☆20Dec 3, 2021Updated 4 years ago
- ☆18Jul 10, 2022Updated 3 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Oct 22, 2019Updated 6 years ago
- realtime latent world model inference demo☆49Nov 11, 2024Updated last year
- ☆26Sep 22, 2025Updated 4 months ago
- [NeurIPS 2025] BOOM, A Planning-driven Model-Based RL algorithm☆58Feb 4, 2026Updated last week
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- train with kittens!☆63Oct 25, 2024Updated last year
- The repository contains code for Adaptive Data Optimization☆32Dec 9, 2024Updated last year
- Can Language Models Solve Olympiad Programming?☆123Jan 14, 2025Updated last year
- Sandbox environment for generalizable agent research☆27Aug 19, 2022Updated 3 years ago
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆34Oct 28, 2025Updated 3 months ago
- ☆292Jul 15, 2024Updated last year
- Alphazero on GPU thanks to CUDA.jl☆33Aug 30, 2021Updated 4 years ago
- Proximal Policy Option-Critic☆26Jan 4, 2019Updated 7 years ago
- ☆35Feb 26, 2024Updated last year