facebookresearch / GCDLinks
Computing the greatest common divisor with transformers, source code for the paper https//arxiv.org/abs/2308.15594
☆15Updated last year
Alternatives and similar repositories for GCD
Users that are interested in GCD are comparing it to the libraries listed below
Sorting:
- ☆18Updated last year
- ☆13Updated this week
- ☆32Updated last year
- Code for the paper "Function-Space Learning Rates"☆20Updated this week
- FlexAttention w/ FlashAttention3 Support☆26Updated 8 months ago
- Personal solutions to the Triton Puzzles☆18Updated 10 months ago
- Jax like function transformation engine but micro, microjax☆32Updated 7 months ago
- Causal Analysis of Agent Behavior for AI Safety☆18Updated last year
- Implementation of Spectral State Space Models☆16Updated last year
- flexible meta-learning in jax☆14Updated last year
- Training hybrid models for dummies.☆21Updated 4 months ago
- ☆29Updated 6 months ago
- Learn online intrinsic rewards from LLM feedback☆37Updated 5 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Updated last year
- Source-to-Source Debuggable Derivatives in Pure Python☆15Updated last year
- Official Code Repository for the paper "Key-value memory in the brain"☆26Updated 3 months ago
- [Oral; Neurips OPT2024 ] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers☆12Updated 2 months ago
- Efficiently send large arrays across machines☆16Updated 10 months ago
- ☆32Updated 8 months ago
- ☆22Updated 7 months ago
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆46Updated 2 years ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆30Updated last week
- This repo is based on https://github.com/jiaweizzhao/GaLore☆28Updated 8 months ago
- ☆19Updated 3 weeks ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- 🧮 Algebraic Positional Encodings.☆13Updated 5 months ago
- Repository of machine learning benchmarks☆36Updated this week
- A basic pure pytorch implementation of flash attention☆16Updated 7 months ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆19Updated 10 months ago
- Repo for solving arc problems with an Neural Cellular Automata☆15Updated 2 weeks ago