keyonvafa / world-model-evaluationLinks
☆56Updated 8 months ago
Alternatives and similar repositories for world-model-evaluation
Users that are interested in world-model-evaluation are comparing it to the libraries listed below
Sorting:
- Code for minimum-entropy coupling.☆32Updated last year
- gzip Predicts Data-dependent Scaling Laws☆35Updated last year
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- Understanding how features learned by neural networks evolve throughout training☆36Updated 8 months ago
- Probabilistic programming with large language models☆124Updated last month
- ☆68Updated 11 months ago
- ☆101Updated 5 months ago
- Evaluation of neuro-symbolic engines☆36Updated 11 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated 2 months ago
- A programming language for formal/informal computation.☆41Updated 2 weeks ago
- Collection of LLM completions for reasoning-gym task datasets☆26Updated last week
- Generative cellular automaton-like learning environments for RL.☆19Updated 5 months ago
- ☆134Updated 3 months ago
- ☆38Updated 11 months ago
- The Energy Transformer block, in JAX☆57Updated last year
- A domain-specific probabilistic programming language for modeling and inference with language models☆132Updated 2 months ago
- Probabilistic LLM evaluations. [CogSci2023; ACL2023]☆73Updated 11 months ago
- ☆30Updated last year
- Experiments for efforts to train a new and improved t5☆76Updated last year
- ☆60Updated 3 years ago
- ☆45Updated 9 months ago
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆59Updated 3 years ago
- Scaling scaling laws with board games.☆49Updated last year
- Official implementation of FIND (NeurIPS '23) Function Interpretation Benchmark and Automated Interpretability Agents☆49Updated 9 months ago
- Functional Benchmarks and the Reasoning Gap☆88Updated 9 months ago
- ☆28Updated 3 weeks ago
- Language-annotated Abstraction and Reasoning Corpus☆88Updated 2 years ago
- A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)☆37Updated 2 years ago
- Materials for ConceptARC paper☆96Updated 8 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆84Updated last year