RewardReports / reward-reports
Documentation for dynamic machine learning systems.
☆26Updated last week
Related projects: ⓘ
- A virtual environment for developing and evaluating automated scientific discovery agents.☆23Updated 3 months ago
- Ludwig benchmark☆19Updated 2 years ago
- A neurosymbolic T5 agent for playing text games, from the EACL 2023 paper "Behavior Cloned Transformers are Neurosymbolic Reasoners"☆19Updated last year
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Updated 3 years ago
- Minimum Description Length probing for neural network representations☆15Updated 11 months ago
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Updated 2 years ago
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆33Updated last week
- Evaluation of neuro-symbolic engines☆29Updated last month
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated last year
- ☆14Updated 5 months ago
- Super fast implementations of common benchmark text world games☆43Updated last month
- Repo to reproduce the First-Explore paper results☆36Updated last year
- ☆29Updated 2 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆30Updated 3 months ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- TaskMet Task-driven Metric Learning for Model Learning☆18Updated 7 months ago
- ☆36Updated 2 weeks ago
- ☆24Updated 2 weeks ago
- ☆13Updated last year
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- ☆33Updated last year
- ☆52Updated 8 months ago
- ☆27Updated last year
- Semi-Markov Afterstate Actor-Critic (SMAAC) with Maze☆10Updated 2 years ago
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆16Updated 2 months ago
- Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch☆23Updated 3 years ago
- Clean RL implementation using MLX☆26Updated 6 months ago
- Understanding RL vision Distill article☆23Updated last year
- A framework for implementing equivariant DL☆10Updated 3 years ago