alxndrTL / ARC_LLMsLinks
Evaluating majors LLMs on the Abstraction and Reasoning Corpus
☆15Updated last year
Alternatives and similar repositories for ARC_LLMs
Users that are interested in ARC_LLMs are comparing it to the libraries listed below
Sorting:
- Simple GRPO scripts and configurations.☆58Updated 4 months ago
- An MDL-based approach to the Abstraction and Reasoning Corpus (ARC) challenge☆18Updated last month
- ☆27Updated 10 months ago
- ☆23Updated last year
- ☆49Updated last year
- ☆55Updated last month
- HomebrewNLP in JAX flavour for maintable TPU-Training☆50Updated last year
- ☆49Updated 7 months ago
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆37Updated last year
- Jax like function transformation engine but micro, microjax☆32Updated 7 months ago
- ☆53Updated last year
- NanoGPT-speedrunning for the poor T4 enjoyers☆66Updated last month
- Collection of autoregressive model implementation☆85Updated last month
- LLM training in simple, raw C/CUDA☆14Updated 6 months ago
- Simple repository for training small reasoning models☆31Updated 4 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆30Updated last week
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆14Updated last year
- Residual Quantization Autoencoder, used for interpreting LLMs☆12Updated 5 months ago
- A Python library for automatically solving Abstraction and Reasoning Corpus (ARC) challenges using Claude and object-centric modeling.☆22Updated 5 months ago
- ☆34Updated 11 months ago
- My explorations into editing the knowledge and memories of an attention network☆35Updated 2 years ago
- ☆68Updated 9 months ago
- Collection of LLM completions for reasoning-gym task datasets☆23Updated 2 weeks ago
- ☆54Updated 6 months ago
- Latent Large Language Models☆18Updated 9 months ago
- ☆27Updated 9 months ago
- ☆22Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- An introduction to LLM Sampling☆78Updated 5 months ago