TheDuckAI / prmLinks
☆12Updated last year
Alternatives and similar repositories for prm
Users that are interested in prm are comparing it to the libraries listed below
Sorting:
- Scaling scaling laws with board games.☆53Updated 2 years ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆83Updated 3 years ago
- Benchmarking Agentic LLM and VLM Reasoning On Games☆228Updated 2 months ago
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆211Updated 2 years ago
- ☆185Updated 2 years ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆185Updated 8 months ago
- Bootstrapping ARC☆155Updated last year
- ☆110Updated last year
- Emergent world representations: Exploring a sequence model trained on a synthetic task☆201Updated 2 years ago
- Language-annotated Abstraction and Reasoning Corpus☆99Updated 2 years ago
- Materials for ConceptARC paper☆112Updated last year
- ☆84Updated 2 years ago
- Multiple datasets for ARC (Abstraction and Reasoning Corpus)☆87Updated 10 months ago
- Train very large language models in Jax.☆210Updated 2 years ago
- Nethack Learning Environment Wrapper for Language Interface☆41Updated 2 years ago
- ☆144Updated 6 months ago
- Abstract Reasoning with Graph Abstractions (ARGA) implementation☆61Updated last year
- Platform to run interactive Reinforcement Learning agents in a Minecraft Server☆56Updated last year
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆21Updated 4 months ago
- Learning Formal Mathematics from Intrinsic Motivation☆36Updated 6 months ago
- ☆35Updated 3 years ago
- Redwood Research's transformer interpretability tools☆15Updated 3 years ago
- [NeurIPS 2023] Learning Transformer Programs☆162Updated last year
- Minimal but scalable implementation of large language models in JAX☆35Updated 2 months ago
- ☆215Updated last month
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆135Updated 3 years ago
- Interpreting how transformers simulate agents performing RL tasks☆90Updated 2 years ago
- Learn online intrinsic rewards from LLM feedback☆45Updated last year
- ☆39Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆134Updated 2 years ago