collinskatie / structured_flexible_and_robust
☆13Updated last year
Related projects: ⓘ
- Few-shot Learning with Auxiliary Data☆26Updated 9 months ago
- Byte-sized text games for code generation tasks on virtual environments☆17Updated 2 months ago
- ☆23Updated 2 weeks ago
- ☆16Updated 10 months ago
- ☆27Updated last year
- Repository for Skill Set Optimization☆12Updated last month
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Updated 3 years ago
- The InterScript dataset contains interactive user feedback on scripts generated by a T5-XXL model.☆11Updated 2 years ago
- Super fast implementations of common benchmark text world games☆43Updated last month
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- Minimum Description Length probing for neural network representations☆15Updated 11 months ago
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆37Updated last year
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Updated 2 years ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆57Updated 10 months ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆23Updated 2 years ago
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆33Updated last week
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆54Updated last year
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆34Updated 6 months ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆26Updated last year
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆28Updated 6 months ago
- Documentation for dynamic machine learning systems.☆26Updated last week
- ☆31Updated 3 years ago
- Tasks for describing differences between text distributions.☆15Updated last month
- Language-annotated Abstraction and Reasoning Corpus☆76Updated last year
- ☆23Updated this week
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆18Updated last year
- Embedding Recycling for Language models☆38Updated last year
- ☆33Updated 2 years ago
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆18Updated 10 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆30Updated last month