MohamedOsman1998 / deep-learning-for-arcLinks

☆13

Alternatives and similar repositories for deep-learning-for-arc

Users that are interested in deep-learning-for-arc are comparing it to the libraries listed below

Sorting:

xjdr-alt / muzero_sketch
☆38Updated 11 months ago
nano-R1 / resources
Compiling useful links, papers, benchmarks, ideas, etc.
☆46Updated 3 months ago
doomslide / autoloom
Approximating the joint distribution of language models via MCTS
☆21Updated 7 months ago
tokenbender / avataRL
rl from zero pretrain, can it be done? we'll see.
☆63Updated this week
SpellcraftAI / turing
Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.
☆58Updated last year
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 8 months ago
neoneye / ARC-Interactive-History-Dataset
The history files when recording human interaction while solving ARC tasks
☆112Updated this week
haizelabs / j1-micro
j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
☆82Updated 3 weeks ago
haizelabs / thorn-in-haizestack
Thorn in a HaizeStack test for evaluating long-context adversarial robustness.
☆26Updated 10 months ago
tyler-romero / microR1
Simple repository for training small reasoning models
☆33Updated 4 months ago
N8python / mlx-pretrain
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆80Updated last month
Ziems / arbor
A framework for optimizing DSPy programs with RL
☆76Updated last week
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 4 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆101Updated 3 months ago
agemoai / arcsolver
A Python library for automatically solving Abstraction and Reasoning Corpus (ARC) challenges using Claude and object-centric modeling.
☆22Updated 5 months ago
clement-bonnet / lpn
Latent Program Network (from the "Searching Latent Program Spaces" paper)
☆88Updated 3 months ago
open-thought / reasoning-gym-eval
Collection of LLM completions for reasoning-gym task datasets
☆24Updated last month
jerber / lang-jepa
☆114Updated 6 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆54Updated 4 months ago
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆137Updated last year
ironbar / arc24
Create an AI capable of solving reasoning tasks it has never seen before
☆78Updated 6 months ago
naklecha / arc-agi-attempts
In this repository I have a code and brief explanations of the attempts that I made at the ARC-AGI (2024) challenges :)
☆23Updated 7 months ago
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆78Updated 6 months ago
rosmineb / unit_test_rl
Project code for training LLMs to write better unit tests + code
☆21Updated last month
kubernetes-bad / reward-composer
Lego for GRPO
☆28Updated last month
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆66Updated 2 months ago
LeonGuertler / UnstableBaselines
☆28Updated this week
brendanhogan / picoDeepResearch
☆63Updated last month
doomslide / attention-graph
A graph visualization of attention
☆56Updated last month
ahstat / episodic-memory-benchmark
Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…
☆46Updated 2 months ago