keyonvafa / world-model-evaluation
☆49Updated last week
Related projects ⓘ
Alternatives and complementary repositories for world-model-evaluation
- Probabilistic programming with HuggingFace language models☆89Updated this week
- Sparse and discrete interpretability tool for neural networks☆55Updated 9 months ago
- ☆75Updated last month
- A programming language for formal/informal computation.☆41Updated 5 months ago
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆43Updated last month
- Evaluation of neuro-symbolic engines☆33Updated 3 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆21Updated 3 weeks ago
- Code for minimum-entropy coupling.☆30Updated 4 months ago
- Experiments for efforts to train a new and improved t5☆76Updated 7 months ago
- gzip Predicts Data-dependent Scaling Laws☆32Updated 5 months ago
- Understanding how features learned by neural networks evolve throughout training☆31Updated last month
- ☆58Updated 2 years ago
- A domain-specific probabilistic programming language for modeling and inference with language models☆112Updated last year
- ☆24Updated 7 months ago
- ☆101Updated 3 months ago
- ☆44Updated last month
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆64Updated 2 years ago
- 🧠 Starter templates for doing interpretability research☆63Updated last year
- Textbook on reinforcement learning from human feedback☆76Updated 3 weeks ago
- ☆68Updated 3 months ago
- Training code for Sparse Autoencoders on Embedding models☆33Updated 3 weeks ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆61Updated last week
- A mechanistic approach for understanding and detecting factual errors of large language models.☆39Updated 4 months ago
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆51Updated 3 weeks ago
- ☆26Updated last year
- ☆18Updated 10 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆93Updated 3 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆20Updated 3 months ago