jfpuget / ARC-AGI-Challenge-2024Links

☆56

Alternatives and similar repositories for ARC-AGI-Challenge-2024

Users that are interested in ARC-AGI-Challenge-2024 are comparing it to the libraries listed below

Sorting:

epfml / DenseFormer
☆81Updated last year
lucidrains / llama-qrlhf
Implementation of the Llama architecture with RLHF + Q-learning
☆167Updated 8 months ago
tyler-romero / microR1
Simple repository for training small reasoning models
☆44Updated 8 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 6 months ago
lucidrains / grokfast-pytorch
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"
☆102Updated 10 months ago
dvruette / barrel-rec-pytorch
☆53Updated last year
lucidrains / mind-evolution
Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind
☆57Updated 5 months ago
epfml / schedules-and-scaling
Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"
☆84Updated last year
apoorvkh / academic-pretraining
$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
☆147Updated 3 weeks ago
athms / mad-lab
A MAD laboratory to improve AI architecture designs 🧪
☆131Updated 10 months ago
idiap / sigma-gpt
σ-GPT: A New Approach to Autoregressive Models
☆68Updated last year
lucidrains / transformer-directed-evolution
Explorations into whether a transformer with RL can direct a genetic algorithm to converge faster
☆71Updated 5 months ago
Aleph-Alpha-Research / trigrams
☆57Updated 3 weeks ago
LucasPrietoAl / grokking-at-the-edge-of-numerical-stability
☆102Updated 3 months ago
amirzandieh / HyperAttention
Triton Implementation of HyperAttention Algorithm
☆48Updated last year
cloneofsimo / min-max-gpt
Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training
☆132Updated last year
ml-jku / EVA
One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation
☆44Updated last week
lucidrains / GAF-microbatch-pytorch
Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch
☆25Updated 9 months ago
lucidrains / infini-transformer-pytorch
Implementation of Infini-Transformer in Pytorch
☆113Updated 9 months ago
YuchenJin / llm.c
LLM training in simple, raw C/CUDA
☆15Updated 10 months ago
cloneofsimo / min-fsdp
☆91Updated last year
EleutherAI / nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆168Updated 4 months ago
shikaiqiu / compute-better-spent
☆58Updated last year
bloc97 / DeMo
DeMo: Decoupled Momentum Optimization
☆194Updated 10 months ago
joey00072 / microjax
Jax like function transformation engine but micro, microjax
☆33Updated last year
Think-a-Tron / evolve
open source alpha evolve
☆66Updated 5 months ago
tanaymeh / mamba-train
A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM
☆59Updated last year
NousResearch / StripedHyenaTrainer
☆61Updated last year
drisspg / transformer_nuggets
A place to store reusable transformer components of my own creation or found on the interwebs
☆59Updated 2 weeks ago
mcleish7 / arithmetic
Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)
☆193Updated last year