khalil-research / 1D-ARC
☆24Updated last year
Alternatives and similar repositories for 1D-ARC:
Users that are interested in 1D-ARC are comparing it to the libraries listed below
- ☆29Updated last year
- Bootstrapping ARC☆104Updated 4 months ago
- Materials for ConceptARC paper☆89Updated 4 months ago
- Language-annotated Abstraction and Reasoning Corpus☆83Updated last year
- ☆18Updated 3 weeks ago
- ☆82Updated 8 months ago
- Abstract Reasoning with Graph Abstractions (ARGA) implementation☆60Updated 8 months ago
- ☆26Updated 6 months ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆145Updated 4 months ago
- Benchmarking Agentic LLM and VLM Reasoning On Games☆121Updated last week
- Multiple datasets for ARC (Abstraction and Reasoning Corpus)☆53Updated last month
- A Gymnasium-based Environment of the Abstraction and Reasoning Corpus (ARC)☆64Updated 6 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆142Updated last month
- maze datasets for investigating OOD behavior of ML systems☆37Updated this week
- ☆9Updated 2 months ago
- ☆22Updated 11 months ago
- ☆34Updated 11 months ago
- Our solution for the arc challenge 2024☆112Updated 3 weeks ago
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆64Updated 2 years ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆73Updated last year
- ☆33Updated last year
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆44Updated last month
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆205Updated last year
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆103Updated last year
- ☆96Updated 8 months ago
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆37Updated 3 months ago
- A library for efficient patching and automatic circuit discovery.☆59Updated last month
- Interpreting how transformers simulate agents performing RL tasks☆78Updated last year
- ☆173Updated last year
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆119Updated 6 months ago