aolabsai / OpenReasonLinks
an open source dataset and generation pipeline for Large-Scale Reinforcement Learning
☆17Updated 9 months ago
Alternatives and similar repositories for OpenReason
Users that are interested in OpenReason are comparing it to the libraries listed below
Sorting:
- A template gymnasium environment for users to build upon☆22Updated last year
- Interpretability dashboard for reinforcement learners☆16Updated 6 years ago
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆16Updated 7 months ago
- ☆105Updated 6 months ago
- ☆28Updated 2 months ago
- ☆238Updated 2 months ago
- Causal DAG Extraction from Text (DEFT)☆66Updated last year
- The history files when recording human interaction while solving ARC tasks☆117Updated last week
- Exercises of the reinforcement learning course from Hugging Face☆13Updated 2 years ago
- Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers☆114Updated last year
- Repo for the paper on Escalation Risks of AI systems☆44Updated last year
- Partially Observable Multi-Agent RL with Transformers☆17Updated this week
- Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their…☆160Updated 3 months ago
- Open Source version of SigOpt API, performing hyperparameter optimization and visualization☆42Updated 11 months ago
- Interpret text data with LLMs (sklearn compatible).☆176Updated last week
- Public repository containing METR's DVC pipeline for eval data analysis☆189Updated this week
- Benchmark for LLMs playing full press diplomacy☆56Updated 10 months ago
- My writings about ARC (Abstraction and Reasoning Corpus)☆90Updated last month
- Automating enterprise workflows with multimodal agents☆115Updated last year
- Create an AI capable of solving reasoning tasks it has never seen before☆96Updated last year
- ☆133Updated 3 months ago
- ☆161Updated last year
- ☆134Updated last year
- ☆67Updated 6 months ago
- Exca - Execution and caching tool for python☆113Updated last week
- The Foundation Model Transparency Index☆85Updated last month
- Code for 1st place solution to Kaggle's Abstraction and Reasoning Challenge☆163Updated 6 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- Draw more samples☆198Updated last year
- ☆125Updated last year