rgreenblatt / arc_draw_more_samples_pubLinks

Draw more samples

☆193

Alternatives and similar repositories for arc_draw_more_samples_pub

Users that are interested in arc_draw_more_samples_pub are comparing it to the libraries listed below

Sorting:

michaelhodel / re-arc
Reverse Engineering the Abstraction and Reasoning Corpus
☆291Updated 5 months ago
justinchiu / openlogprobs
Extract full next-token probabilities via language model APIs
☆247Updated last year
ekinakyurek / marc
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
☆321Updated 8 months ago
michaelhodel / arc-dsl
Domain Specific Language for the Abstraction and Reasoning Corpus
☆285Updated 9 months ago
METR / RE-Bench
☆95Updated 3 months ago
xu3kev / BARC
Bootstrapping ARC
☆139Updated 8 months ago
jerber / lang-jepa
☆118Updated 7 months ago
neoneye / ARC-Interactive-History-Dataset
The history files when recording human interaction while solving ARC tasks
☆114Updated last week
METR / public-tasks
☆99Updated 4 months ago
magicproduct / hash-hop
Long context evaluation for large language models
☆220Updated 5 months ago
JD-P / minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆177Updated 3 weeks ago
OSU-NLP-Group / GrokkedTransformer
Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
☆226Updated 2 weeks ago
da-fr / arc-prize-2024
Our solution for the arc challenge 2024
☆166Updated last month
goodfire-ai / r1-interpretability
Open source interpretability artefacts for R1.
☆157Updated 3 months ago
mcleish7 / arithmetic
Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)
☆190Updated last year
LeonGuertler / TextArena
A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning
☆225Updated this week
srush / GPTWorld
A puzzle to learn about prompting
☆132Updated 2 years ago
METR / task-standard
METR Task Standard
☆156Updated 6 months ago
kanishkg / stream-of-search
Repository for the paper Stream of Search: Learning to Search in Language
☆149Updated 6 months ago
callummcdougall / sae_vis
Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).
☆207Updated 7 months ago
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆138Updated last year
google-deepmind / mishax
☆136Updated 4 months ago
LeonGuertler / UnstableBaselines
☆96Updated last week
neoneye / arc-dataset-collection
Multiple datasets for ARC (Abstraction and Reasoning Corpus)
☆76Updated 4 months ago
TransluceAI / observatory
A toolkit for describing model features and intervening on those features to steer behavior.
☆195Updated 8 months ago
victorvikram / ConceptARC
Materials for ConceptARC paper
☆98Updated 9 months ago
hydrallm / llama-moe-v1
☆95Updated 2 years ago
HazyResearch / zoology
Understand and test language model architectures on synthetic tasks.
☆221Updated 3 weeks ago
EleutherAI / concept-erasure
Erasing concepts from neural representations with provable guarantees
☆231Updated 6 months ago
mxbi / arckit
Tools for working with the Abstraction & Reasoning Corpus
☆196Updated 11 months ago