RewardReports / reward-reports
Documentation for dynamic machine learning systems.
☆27Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for reward-reports
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated last year
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Updated last year
- Minimum Description Length probing for neural network representations☆16Updated last week
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- Super fast implementations of common benchmark text world games☆43Updated 2 weeks ago
- ☆14Updated last month
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆36Updated 2 months ago
- Ludwig benchmark☆19Updated 2 years ago
- Repository for Skill Set Optimization☆12Updated 3 months ago
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Updated 2 years ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- ☆34Updated last year
- ☆25Updated 3 weeks ago
- ☆53Updated 2 weeks ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆31Updated 5 months ago
- A framework for implementing equivariant DL☆10Updated 3 years ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 2 months ago
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Updated last year
- INTeractive learning via REPresentatIon Discovery☆34Updated 5 months ago
- ☆29Updated 2 years ago
- Repo to reproduce the First-Explore paper results☆36Updated 2 weeks ago
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆18Updated 4 months ago
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- A sample pattern for running CI tests on Modal☆13Updated 2 months ago
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Updated 3 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆26Updated last year
- ☆18Updated 7 months ago
- TaskMet Task-driven Metric Learning for Model Learning☆18Updated 9 months ago
- Submission to the inverse scaling prize☆23Updated last year