aolabsai / OpenReasonLinks
an open source dataset and generation pipeline for Large-Scale Reinforcement Learning
☆17Updated 9 months ago
Alternatives and similar repositories for OpenReason
Users that are interested in OpenReason are comparing it to the libraries listed below
Sorting:
- Interpretability dashboard for reinforcement learners☆16Updated 6 years ago
- A template gymnasium environment for users to build upon☆22Updated last year
- Enjoy puzzle-solving directly in your browser.☆32Updated 9 months ago
- My writings about ARC (Abstraction and Reasoning Corpus)☆90Updated last month
- ☆107Updated this week
- The history files when recording human interaction while solving ARC tasks☆117Updated 2 weeks ago
- A clean no-jargon mathematical definition of transforrmer language model with a Python implementation that focuses on clarity rather than…☆11Updated 3 years ago
- General-Sum variant of the game Diplomacy for evaluating AIs.☆34Updated last year
- Benchmark for LLMs playing full press diplomacy☆56Updated 11 months ago
- Causal DAG Extraction from Text (DEFT)☆66Updated last year
- ☆22Updated last year
- ☆40Updated 3 weeks ago
- ☆239Updated 2 months ago
- ☆28Updated 2 months ago
- Partially Observable Multi-Agent RL with Transformers☆17Updated last week
- Draw more samples☆198Updated last year
- Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their…☆160Updated 3 months ago
- Code for minimum-entropy coupling.☆32Updated last month
- gzip Predicts Data-dependent Scaling Laws☆34Updated last year
- ☆42Updated 2 years ago
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Updated last year
- The Foundation Model Transparency Index☆85Updated last month
- Open-source Human Feedback Library☆11Updated 2 years ago
- Our solution for the arc challenge 2024☆188Updated 7 months ago
- Tools for working with the Abstraction & Reasoning Corpus☆215Updated 5 months ago
- A repo to evaluate various LLM's chess playing abilities.☆87Updated last year
- ☆134Updated last year
- ☆22Updated last year
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆17Updated 7 months ago
- Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…☆216Updated 3 weeks ago