rgreenblatt / arc_draw_more_samples_pub
Draw more samples
☆186Updated 9 months ago
Alternatives and similar repositories for arc_draw_more_samples_pub:
Users that are interested in arc_draw_more_samples_pub are comparing it to the libraries listed below
- Bootstrapping ARC☆105Updated 4 months ago
- The history files when recording human interaction while solving ARC tasks☆103Updated this week
- Reverse Engineering the Abstraction and Reasoning Corpus☆246Updated last month
- Domain Specific Language for the Abstraction and Reasoning Corpus☆248Updated 5 months ago
- Extract full next-token probabilities via language model APIs☆237Updated last year
- ☆106Updated 3 months ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆101Updated this week
- Our solution for the arc challenge 2024☆119Updated last month
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆186Updated 4 months ago
- METR Task Standard☆146Updated last month
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆169Updated this week
- Multiple datasets for ARC (Abstraction and Reasoning Corpus)☆57Updated this week
- ☆67Updated 2 months ago
- ☆87Updated 2 weeks ago
- Repository for the paper Stream of Search: Learning to Search in Language☆142Updated 2 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆76Updated 3 weeks ago
- Testing baseline LLMs performance across various models☆244Updated last week
- Long context evaluation for large language models☆201Updated 3 weeks ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆168Updated 2 months ago
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆302Updated 4 months ago
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆288Updated 3 weeks ago
- ☆92Updated last year
- smol models are fun too☆91Updated 4 months ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆187Updated 10 months ago
- Create an AI capable of solving reasoning tasks it has never seen before☆43Updated 3 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆166Updated 3 weeks ago
- Simple Transformer in Jax☆136Updated 9 months ago
- smolLM with Entropix sampler on pytorch☆151Updated 5 months ago
- ☆124Updated last week
- Verdict is a library for scaling judge-time compute.☆190Updated 2 weeks ago