carlini / chess-llmLinks
Play chess against large language models.
☆47Updated last year
Alternatives and similar repositories for chess-llm
Users that are interested in chess-llm are comparing it to the libraries listed below
Sorting:
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- Measuring the situational awareness of language models☆37Updated last year
- OMNI: Open-endedness via Models of human Notions of Interestingness☆50Updated 5 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆129Updated last year
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆60Updated 4 months ago
- ☆22Updated 9 months ago
- (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling☆124Updated last year
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Updated 10 months ago
- Repo to reproduce the First-Explore paper results☆37Updated 6 months ago
- Official repository of the spotlight ICML 2025 paper, PokeChamp: an Expert-level Minimax Language Agent.☆69Updated last week
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 10 months ago
- Lottery Ticket Adaptation☆39Updated 7 months ago
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆29Updated last year
- ☆88Updated last month
- Plug in & Play Pytorch Implementation of the paper: "Evolutionary Optimization of Model Merging Recipes" by Sakana AI☆30Updated 8 months ago
- ☆99Updated last year
- BASALT Benchmark datasets, evaluation code and agent training example.☆20Updated last year
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆93Updated 2 years ago
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆63Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆14Updated last year
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆190Updated last year
- Multi-Domain Expert Learning☆67Updated last year
- General multi-task deep RL Agent☆183Updated last year
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆60Updated 6 months ago
- ☆41Updated last week
- Experiments for efforts to train a new and improved t5☆76Updated last year
- ☆137Updated 8 months ago
- σ-GPT: A New Approach to Autoregressive Models☆65Updated 11 months ago
- Learn online intrinsic rewards from LLM feedback☆41Updated 7 months ago