alxndrTL / ARC_LLMsLinks

Evaluating majors LLMs on the Abstraction and Reasoning Corpus

☆15

Alternatives and similar repositories for ARC_LLMs

Users that are interested in ARC_LLMs are comparing it to the libraries listed below

Sorting:

Aleph-Alpha-Research / trigrams
☆56Updated 2 months ago
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 5 months ago
IBM / abductive-rule-learner-with-context-awareness
ARLC, a probabilistic abductive reasoner for solving Raven's progressive matrices.
☆18Updated 2 months ago
joey00072 / microjax
Jax like function transformation engine but micro, microjax
☆33Updated 8 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 2 months ago
epfml / DenseFormer
☆81Updated last year
latticetower / kaggle-arc
my solution for Abstaction and reasoning challenge on kaggle
☆10Updated last year
lucidrains / memory-editable-transformer
My explorations into editing the knowledge and memories of an attention network
☆35Updated 2 years ago
YuchenJin / llm.c
LLM training in simple, raw C/CUDA
☆15Updated 7 months ago
HomebrewML / Olmax
HomebrewNLP in JAX flavour for maintable TPU-Training
☆50Updated last year
dvruette / barrel-rec-pytorch
☆53Updated last year
huggingface / peft-pytorch-conference
Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…
☆14Updated last year
davisyoshida / easy-lora-and-gptq
JAX notebook showing how to LoRA + GPTQ arbitrary models
☆10Updated last year
tanchongmin / ARC-Challenge
☆27Updated 10 months ago
RobertCsordas / moe
Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"
☆38Updated last month
NousResearch / StripedHyenaTrainer
☆61Updated last year
google-deepmind / asyncdiloco
☆45Updated last year
CarperAI / squeakily
A library for squeakily cleaning and filtering language datasets.
☆47Updated 2 years ago
Zyphra / Zyda_processing
☆36Updated last year
lucidrains / grokfast-pytorch
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"
☆101Updated 7 months ago
apple / ml-hypercloning
☆48Updated 8 months ago
tyler-romero / microR1
Simple repository for training small reasoning models
☆33Updated 5 months ago
sap-ient-ai / FFF
FastFeedForward Networks
☆20Updated last year
krypticmouse / matryoshka-representation-learning
PyTorch implementation for MRL
☆19Updated last year
arcee-ai / DAM
☆53Updated 8 months ago
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆46Updated last year
srush / drop7
☆18Updated last year
LucasPrietoAl / grokking-at-the-edge-of-numerical-stability
☆98Updated 6 months ago
xrsrke / pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
☆85Updated last year
EleutherAI / training-jacobian
☆23Updated 7 months ago