alxndrTL / ARC_LLMsLinks
Evaluating majors LLMs on the Abstraction and Reasoning Corpus
☆17Updated 2 years ago
Alternatives and similar repositories for ARC_LLMs
Users that are interested in ARC_LLMs are comparing it to the libraries listed below
Sorting:
- ☆30Updated last year
- ☆53Updated 2 years ago
- ☆59Updated 2 months ago
- ☆82Updated last year
- Collection of autoregressive model implementation☆85Updated last month
- Create an AI capable of solving reasoning tasks it has never seen before☆96Updated last year
- ARC gym: a data generation framework for the Abstraction & Reasoning Corpus☆25Updated last week
- A Python library for automatically solving Abstraction and Reasoning Corpus (ARC) challenges using Claude and object-centric modeling.☆25Updated last year
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆38Updated 8 months ago
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆66Updated 2 months ago
- Simple GRPO scripts and configurations.☆59Updated last year
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆86Updated 2 years ago
- Materials for ConceptARC paper☆114Updated this week
- Token Omission Via Attention☆128Updated last year
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆183Updated 3 months ago
- ☆208Updated 3 weeks ago
- ☆27Updated last year
- ☆47Updated 2 years ago
- ☆32Updated 2 years ago
- Code repository for the c-BTM paper☆108Updated 2 years ago
- A Gymnasium-based Environment of the Abstraction and Reasoning Corpus (ARC)☆69Updated last year
- ☆94Updated 2 years ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- LLM training in simple, raw C/CUDA☆15Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Updated 2 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Updated 2 years ago
- Multi-Domain Expert Learning☆67Updated 2 years ago
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- some common Huggingface transformers in maximal update parametrization (µP)☆87Updated 3 years ago
- Multiple datasets for ARC (Abstraction and Reasoning Corpus)☆87Updated 10 months ago