RewardReports / reward-reportsLinks
Documentation for dynamic machine learning systems.
☆29Updated 10 months ago
Alternatives and similar repositories for reward-reports
Users that are interested in reward-reports are comparing it to the libraries listed below
Sorting:
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Updated 2 years ago
- For experiments involving instruct gpt. Currently used for documenting open research questions.☆71Updated 2 years ago
- Finding semantically meaningful and accurate prompts.☆47Updated last year
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Updated 2 years ago
- Ludwig benchmark☆20Updated 3 years ago
- ☆29Updated last year
- Google Research☆46Updated 2 years ago
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- ☆34Updated 2 years ago
- Implementation of "Analysing Mathematical Reasoning Abilities of Neural Models"☆30Updated 2 years ago
- A Toolkit for Distributional Control of Generative Models☆73Updated last year
- ☆29Updated 2 years ago
- Super fast implementations of common benchmark text world games☆49Updated 4 months ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 3 years ago
- Minimum Description Length probing for neural network representations☆18Updated 5 months ago
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Updated 2 years ago
- ☆13Updated 2 years ago
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Updated 4 years ago
- ☆56Updated 8 months ago
- A simple way to manage and store the data related to all your research papers!☆18Updated 2 years ago
- ☆41Updated 10 months ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- Residual Quantization Autoencoder, used for interpreting LLMs☆12Updated 6 months ago
- Code for 'Emergent Analogical Reasoning in Large Language Models'☆51Updated last year
- ☆23Updated last year
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆33Updated last year
- Understanding how features learned by neural networks evolve throughout training☆36Updated 8 months ago
- Entailment self-training☆25Updated 2 years ago
- Implementations of Curious Replay for model-based adaptation.☆41Updated 2 years ago