jfpuget / ARC-AGI-Challenge-2024
☆46Updated last month
Alternatives and similar repositories for ARC-AGI-Challenge-2024:
Users that are interested in ARC-AGI-Challenge-2024 are comparing it to the libraries listed below
- Collection of autoregressive model implementation☆76Updated last week
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆40Updated this week
- ☆53Updated last year
- ☆78Updated 9 months ago
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆48Updated 5 months ago
- Official implementation of "BERTs are Generative In-Context Learners"☆23Updated 7 months ago
- supporting pytorch FSDP for optimizers☆75Updated last month
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆95Updated 3 weeks ago
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizers☆77Updated 6 months ago
- Implementation of Infini-Transformer in Pytorch☆107Updated 2 weeks ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆54Updated 4 months ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆87Updated 7 months ago
- Your favourite classical machine learning algos on the GPU/TPU☆20Updated 2 weeks ago
- This is a port of Mistral-7B model in JAX☆30Updated 6 months ago
- ☆146Updated last month
- Explorations into the recently proposed Taylor Series Linear Attention☆91Updated 5 months ago
- ☆37Updated 9 months ago
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆44Updated 3 months ago
- ☆75Updated 6 months ago
- A fast implementation of T5/UL2 in PyTorch using Flash Attention☆75Updated this week
- An introduction to LLM Sampling☆75Updated last month
- DeMo: Decoupled Momentum Optimization☆170Updated last month
- Latent Diffusion Language Models☆68Updated last year
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆36Updated 3 months ago
- ☆20Updated 3 months ago
- Utilities for PyTorch distributed☆23Updated last year
- ☆56Updated last week
- ☆49Updated 4 months ago
- Implementation of the proposed Spline-Based Transformer from Disney Research☆85Updated 2 months ago