keyonvafa / world-model-evaluation
☆55Updated 5 months ago
Alternatives and similar repositories for world-model-evaluation:
Users that are interested in world-model-evaluation are comparing it to the libraries listed below
- A programming language for formal/informal computation.☆41Updated 3 weeks ago
- Sparse and discrete interpretability tool for neural networks☆61Updated last year
- Probabilistic programming with large language models☆116Updated 3 weeks ago
- Implementing RASP transformer programming language https://arxiv.org/pdf/2106.06981.pdf.☆53Updated 3 years ago
- Code for minimum-entropy coupling.☆31Updated 10 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆44Updated 2 weeks ago
- ☆19Updated last week
- ☆92Updated 2 months ago
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆21Updated 11 months ago
- ☆60Updated 3 years ago
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆48Updated 6 months ago
- gzip Predicts Data-dependent Scaling Laws☆35Updated 11 months ago
- Generative cellular automaton-like learning environments for RL.☆19Updated 3 months ago
- Understanding how features learned by neural networks evolve throughout training☆34Updated 6 months ago
- Materials for ConceptARC paper☆92Updated 6 months ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆42Updated 5 months ago
- A domain-specific probabilistic programming language for modeling and inference with language models☆129Updated last week
- Experiments for efforts to train a new and improved t5☆77Updated last year
- Evaluation of neuro-symbolic engines☆35Updated 9 months ago
- ☆38Updated 9 months ago
- Learn online intrinsic rewards from LLM feedback☆37Updated 4 months ago
- Collection of LLM completions for reasoning-gym task datasets☆19Updated last week
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Updated 4 months ago
- ☆27Updated last year
- A reading list of relevant papers and projects on foundation model annotation☆27Updated 2 months ago
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆65Updated 2 years ago
- Probabilistic LLM evaluations. [CogSci2023; ACL2023]☆73Updated 9 months ago
- Mechanistic Interpretability for Transformer Models☆50Updated 2 years ago
- Universal Neurons in GPT2 Language Models☆28Updated 11 months ago
- ☆11Updated this week