JackHopkins / factorio-learning-environmentLinks
A non-saturating, open-ended environment for evaluating LLMs in Factorio
☆870Updated last week
Alternatives and similar repositories for factorio-learning-environment
Users that are interested in factorio-learning-environment are comparing it to the libraries listed below
Sorting:
- ☆177Updated 3 weeks ago
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆295Updated 4 months ago
- Grandmaster-Level Chess Without Search☆597Updated 11 months ago
- ☆1,157Updated last year
- A python library to artfully visualize Factorio Blueprints and an interactive web demo for using it.☆578Updated 11 months ago
- ☆164Updated 9 months ago
- Live-bending a foundation model’s output at neural network level.☆272Updated 8 months ago
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆628Updated 9 months ago
- High-Performance Implementation of OpenAI's TikToken.☆465Updated 5 months ago
- Worker to orchestrate and manage running an arbitrary number of LLM-generated builds concurrently using containerized Minecraft Servers.☆166Updated last year
- Easily train AlphaZero-like agents on any environment you want!☆433Updated last year
- ☆179Updated 8 months ago
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆375Updated last year
- ☆165Updated 5 months ago
- ☆248Updated 9 months ago
- ☆308Updated last week
- i will automate factorio☆111Updated last year
- A slight upgrade to the Gremlins in your code☆643Updated 5 months ago
- Frontier Models playing the board game Diplomacy.☆610Updated last month
- ☆150Updated 5 months ago
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆219Updated last year
- This repo tracks the opened and merged PRs by the top SWE coding agents by OpenAI, GitHub, and others. Updates every 3 hours.☆296Updated this week
- A LLM trained only on data from certain time periods to reduce modern bias☆814Updated this week
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆928Updated 6 months ago
- Animating R1's thoughts.☆384Updated 10 months ago
- Parallel Reasoning: llm-consortium orchestrates mulitple LLMs, iteratively refines & achieves consensus.☆370Updated last month
- A cache for AI agents to learn and replay complex behaviors.☆758Updated 6 months ago
- Async RL Training at Scale☆950Updated this week
- ☆461Updated last month
- Import wisdom, export code.☆380Updated 3 months ago