collinskatie / structured_flexible_and_robust
☆13Updated last year
Alternatives and similar repositories for structured_flexible_and_robust:
Users that are interested in structured_flexible_and_robust are comparing it to the libraries listed below
- ☆16Updated last year
- Super fast implementations of common benchmark text world games☆44Updated last month
- Byte-sized text games for code generation tasks on virtual environments☆19Updated 6 months ago
- Repository for Skill Set Optimization☆12Updated 5 months ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆28Updated 2 months ago
- ☆23Updated 4 months ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated last year
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Updated last year
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Updated last year
- Documentation for dynamic machine learning systems.☆29Updated 4 months ago
- Few-shot Learning with Auxiliary Data☆26Updated last year
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Updated last week
- Exploring Few-Shot Adaptation of Language Models with Tables☆23Updated 2 years ago
- Minimum Description Length probing for neural network representations☆18Updated this week
- ☆30Updated last year
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆40Updated last month
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆65Updated 2 years ago
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆52Updated 7 months ago
- ☆26Updated last year
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆45Updated last year
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆57Updated last year
- ☆35Updated 2 years ago
- ☆20Updated 3 years ago
- ☆32Updated 3 years ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆51Updated 9 months ago
- ☆13Updated 3 years ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 4 months ago
- ☆17Updated last year
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆48Updated 3 months ago