JackHopkins / factorio-learning-environmentLinks
A non-saturating, open-ended environment for evaluating LLMs in Factorio
☆736Updated this week
Alternatives and similar repositories for factorio-learning-environment
Users that are interested in factorio-learning-environment are comparing it to the libraries listed below
Sorting:
- ☆153Updated 3 weeks ago
- ☆1,048Updated 7 months ago
- Grandmaster-Level Chess Without Search☆580Updated 5 months ago
- A python library to artfully visualize Factorio Blueprints and an interactive web demo for using it.☆571Updated 5 months ago
- ☆163Updated 3 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆311Updated 8 months ago
- ☆210Updated 3 months ago
- ☆131Updated 2 months ago
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆277Updated 2 weeks ago
- i will automate factorio☆106Updated 10 months ago
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆616Updated 3 months ago
- Agent Reinforcement Trainer for training multi-turn agents using GRPO☆751Updated this week
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆368Updated last year
- Browser-LLM Auto-Scaling Technology☆524Updated this week
- Animating R1's thoughts.☆382Updated 4 months ago
- Live-bending a foundation model’s output at neural network level.☆259Updated 2 months ago
- Applying the ideas of Deepseek R1 to computer use☆214Updated 4 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆737Updated 2 weeks ago
- Enhancing the Factorio experience with SAT solvers☆740Updated 10 months ago
- A browser-based, WebGL2 implementation of GPT-2 with transform block and attention matrix visualization☆326Updated last week
- ☆145Updated 2 months ago
- explore token trajectory trees on instruct and base models☆127Updated 3 weeks ago
- This repo tracks the opened and merged PRs by the top SWE coding agents by OpenAI, GitHub, and others. Updates every 3 hours.☆146Updated this week
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆126Updated 2 months ago
- This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software E…☆1,408Updated last month
- HTTP API for Claude Code, Goose, Aider, and Codex☆605Updated this week
- Visualize the intermediate output of Mistral 7B☆367Updated 5 months ago
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆514Updated this week
- Diffusion on syntax trees for program synthesis☆459Updated 11 months ago
- Fully neural approach for text chunking☆357Updated last month