facebookresearch / GCDLinks
Computing the greatest common divisor with transformers, source code for the paper https//arxiv.org/abs/2308.15594
☆15Updated last year
Alternatives and similar repositories for GCD
Users that are interested in GCD are comparing it to the libraries listed below
Sorting:
- ☆18Updated last year
- Code for the paper "Function-Space Learning Rates"☆20Updated 3 weeks ago
- ☆13Updated last week
- ☆13Updated last month
- ☆32Updated 8 months ago
- Generative cellular automaton-like learning environments for RL.☆19Updated 4 months ago
- ☆9Updated 2 months ago
- We study toy models of skill learning.☆28Updated 5 months ago
- [CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model☆21Updated 8 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- Repo for solving arc problems with an Neural Cellular Automata☆17Updated last month
- A simple hypernetwork implementation in jax using haiku.☆23Updated 2 years ago
- Efficient Scaling laws and collaborative pretraining.☆16Updated 5 months ago
- ☆16Updated last year
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆27Updated last month
- ☆23Updated 6 months ago
- Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.☆44Updated 4 months ago
- Learn online intrinsic rewards from LLM feedback☆41Updated 6 months ago
- Implementation of Spectral State Space Models☆16Updated last year
- ☆11Updated last year
- ☆23Updated 8 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆30Updated 3 weeks ago
- Fast reinforcement learning 💨☆24Updated 3 months ago
- ☆21Updated 9 months ago
- Experiments on the impact of depth in transformers and SSMs.☆31Updated 7 months ago
- ☆32Updated last year
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Updated last year
- ☆28Updated last year
- implementation of dualformer☆17Updated 3 months ago
- 🧮 Algebraic Positional Encodings.☆14Updated 5 months ago