carlini / chess-llmLinks
Play chess against large language models.
☆47Updated last year
Alternatives and similar repositories for chess-llm
Users that are interested in chess-llm are comparing it to the libraries listed below
Sorting:
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆75Updated 7 months ago
- (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling☆123Updated last year
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- ☆28Updated last year
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆190Updated last year
- Measuring the situational awareness of language models☆35Updated last year
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆18Updated 5 months ago
- Adversarial Attacks on GPT-4 via Simple Random Search [Dec 2023]☆43Updated last year
- ☆55Updated 9 months ago
- Sparse Autoencoder Training Library☆52Updated last month
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆66Updated 2 years ago
- Repo to reproduce the First-Explore paper results☆37Updated 6 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆50Updated 5 months ago
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆21Updated last year
- ☆134Updated 7 months ago
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆63Updated 7 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- Repository for the paper Stream of Search: Learning to Search in Language☆149Updated 4 months ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆78Updated 2 years ago
- ☆66Updated last month
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆136Updated last year
- ☆96Updated last year
- A repository for training nanogpt-based Chess playing language models.☆24Updated last year
- RuLES: a benchmark for evaluating rule-following in language models☆226Updated 4 months ago
- ☆53Updated last year
- Memoria is a human-inspired memory architecture for neural networks.☆74Updated 8 months ago
- Language models scale reliably with over-training and on downstream tasks☆97Updated last year
- Dateset Reset Policy Optimization☆30Updated last year