alxndrTL / ARC_LLMs
Evaluating majors LLMs on the Abstraction and Reasoning Corpus
☆16Updated last year
Alternatives and similar repositories for ARC_LLMs:
Users that are interested in ARC_LLMs are comparing it to the libraries listed below
- ☆23Updated last year
- ☆54Updated 7 months ago
- ☆53Updated last year
- Simple GRPO scripts and configurations.☆58Updated 2 months ago
- Collection of autoregressive model implementation☆85Updated 2 months ago
- ☆27Updated 7 months ago
- Jax like function transformation engine but micro, microjax☆30Updated 6 months ago
- ☆48Updated 5 months ago
- ARLC, a probabilistic abductive reasoner for solving Raven's progressive matrices.☆18Updated last month
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 10 months ago
- Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.☆40Updated 2 months ago
- Latent Large Language Models☆17Updated 8 months ago
- Create an AI capable of solving reasoning tasks it has never seen before☆61Updated 4 months ago
- ☆26Updated 10 months ago
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆19Updated last year
- My explorations into editing the knowledge and memories of an attention network☆34Updated 2 years ago
- ☆20Updated last year
- ☆16Updated 2 months ago
- ☆43Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- ☆27Updated 9 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆32Updated 5 months ago
- ☆18Updated last year
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆37Updated last year
- Triton Implementation of HyperAttention Algorithm☆47Updated last year
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Updated 2 years ago
- ☆33Updated 10 months ago
- gzip Predicts Data-dependent Scaling Laws☆34Updated 10 months ago
- Implementation of Spectral State Space Models☆16Updated last year
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆31Updated last year