keyonvafa / world-model-evaluation
☆46Updated 2 months ago
Related projects: ⓘ
- Sparse and discrete interpretability tool for neural networks☆51Updated 7 months ago
- gzip Predicts Data-dependent Scaling Laws☆31Updated 3 months ago
- Evaluation of neuro-symbolic engines☆29Updated last month
- ☆23Updated 2 weeks ago
- ☆39Updated 2 months ago
- Code for minimum-entropy coupling.☆29Updated 2 months ago
- ☆91Updated last month
- ☆66Updated last month
- Understanding how features learned by neural networks evolve throughout training☆30Updated this week
- Repository for the paper Stream of Search: Learning to Search in Language☆70Updated last month
- A programming language for formal/informal computation.☆39Updated 3 months ago
- ☆8Updated last week
- Generative cellular automaton-like learning environments for RL.☆19Updated last month
- A domain-specific probabilistic programming language for modeling and inference with language models☆111Updated 11 months ago
- ☆17Updated 4 months ago
- NLP with Rust for Python 🦀🐍☆57Updated 3 months ago
- Probabilistic programming with HuggingFace language models☆83Updated 2 weeks ago
- ☆56Updated 2 years ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆37Updated 3 months ago
- ☆68Updated last month
- Experiments for efforts to train a new and improved t5☆76Updated 5 months ago
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆58Updated 2 years ago
- Probabilistic LLM evaluations. [CogSci2023; ACL2023]☆72Updated last month
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆19Updated last week
- ☆22Updated last year
- ☆54Updated last week
- 🧠 Starter templates for doing interpretability research☆59Updated last year
- Materials for ConceptARC paper☆71Updated 3 months ago
- Functional Benchmarks and the Reasoning Gap☆74Updated last month
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆41Updated 3 months ago