adewynter / DoomLinks
Repository for the paper "Will GPT-4 Run DOOM?"
☆24Updated last year
Alternatives and similar repositories for Doom
Users that are interested in Doom are comparing it to the libraries listed below
Sorting:
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆115Updated last year
- Generative cellular automaton-like learning environments for RL.☆19Updated 10 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆57Updated 10 months ago
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆94Updated 2 years ago
- ☆58Updated 5 months ago
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆64Updated last year
- Official repository of the spotlight ICML 2025 paper, PokeChamp: an Expert-level Minimax Language Agent.☆132Updated 2 months ago
- Training AI for Super Smash Bros. Melee☆31Updated 9 months ago
- General multi-task deep RL Agent☆186Updated last year
- ☆98Updated last week
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆34Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆134Updated 2 years ago
- ☆27Updated last year
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆39Updated last year
- Official repository of the NeurIPS 2025 Competition: The PokeAgent Challenge: Competitive and Long-Context Learning at Scale. (Track 2, S…☆67Updated last week
- σ-GPT: A New Approach to Autoregressive Models☆70Updated last year
- Memoria is a human-inspired memory architecture for neural networks.☆80Updated last year
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆60Updated 2 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- ☆15Updated last year
- Exploration into the Firefly algorithm in Pytorch☆41Updated 10 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆122Updated 2 months ago
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR'24, Spotlight)☆66Updated 2 years ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆65Updated 10 months ago
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆46Updated 2 years ago
- Verlog: A Multi-turn RL framework for LLM agents☆67Updated last month
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆63Updated 8 months ago
- Implementation of the Llama architecture with RLHF + Q-learning☆168Updated 10 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- Gymnasium environment for Pokemon Red☆45Updated last year