JackHopkins / factorio-learning-environmentLinks
A non-saturating, open-ended environment for evaluating LLMs in Factorio
☆783Updated this week
Alternatives and similar repositories for factorio-learning-environment
Users that are interested in factorio-learning-environment are comparing it to the libraries listed below
Sorting:
- ☆159Updated 2 months ago
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆287Updated 3 weeks ago
- ☆1,126Updated 10 months ago
- A python library to artfully visualize Factorio Blueprints and an interactive web demo for using it.☆579Updated 7 months ago
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆622Updated 5 months ago
- ☆163Updated 5 months ago
- Live-bending a foundation model’s output at neural network level.☆265Updated 4 months ago
- High-Performance Implementation of OpenAI's TikToken.☆449Updated 2 months ago
- ☆225Updated 6 months ago
- Grandmaster-Level Chess Without Search☆588Updated 7 months ago
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆608Updated 6 months ago
- Animating R1's thoughts.☆384Updated 6 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆854Updated 2 months ago
- Browser-LLM Auto-Scaling Technology☆546Updated last week
- ☆150Updated last month
- A LLM trained only on data from certain time periods to reduce modern bias☆521Updated 2 weeks ago
- Easily train AlphaZero-like agents on any environment you want!☆431Updated last year
- ☆151Updated last month
- ☆156Updated 5 months ago
- Applying the ideas of Deepseek R1 to computer use☆216Updated 7 months ago
- Felafax is building AI infra for non-NVIDIA GPUs☆566Updated 7 months ago
- R.L. methods and techniques.☆199Updated 9 months ago
- GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's T…☆240Updated 2 weeks ago
- LLM Analytics☆677Updated 10 months ago
- See Through Your Models☆400Updated last month
- VSCode extension that demonstrates the use of large language models (LLMs) for active debugging of programs☆353Updated 6 months ago
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆373Updated last year
- Import wisdom, export code.☆377Updated 3 months ago
- A hub for various industry-specific schemas to be used with VLMs.☆532Updated 3 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆321Updated 10 months ago