Leiay / looped_transformerView external linksLinks
☆35Dec 12, 2023Updated 2 years ago
Alternatives and similar repositories for looped_transformer
Users that are interested in looped_transformer are comparing it to the libraries listed below
Sorting:
- Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers…☆30Apr 8, 2023Updated 2 years ago
- Generative Equilibrium Transformer☆27Nov 11, 2023Updated 2 years ago
- Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure (NeurIPS 2024) + Arithmetic Transfor…☆14Oct 26, 2025Updated 3 months ago
- Combining SOAP and MUON☆19Feb 11, 2025Updated last year
- ☆23Updated this week
- ☆11Jun 29, 2021Updated 4 years ago
- Code accompanying the paper "A contrastive rule for meta-learning"☆13Oct 31, 2024Updated last year
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆34Jan 16, 2026Updated last month
- Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent☆17Sep 8, 2022Updated 3 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Jun 21, 2019Updated 6 years ago
- Mamba support for transformer lens☆19Sep 17, 2024Updated last year
- PyTorch implementation of StableMask (ICML'24)☆15Jun 27, 2024Updated last year
- ☆20Mar 1, 2023Updated 2 years ago
- Official repo of paper LM2☆47Feb 13, 2025Updated last year
- ☆45Apr 30, 2018Updated 7 years ago
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆56Mar 10, 2025Updated 11 months ago
- Manually implemented quantization-aware training☆23Oct 12, 2022Updated 3 years ago
- benchmarking some transformer deployments☆26Dec 15, 2025Updated 2 months ago
- ☆27Feb 1, 2023Updated 3 years ago
- Fast matrix multiplication for few-bit integer matrices on CPUs.☆28Mar 19, 2019Updated 6 years ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated last year
- Educational verilog library that supports IEEE754 floating point arithmetic with a parametrizable mantissa and exponent☆32Mar 13, 2025Updated 11 months ago
- Official Code Repository for the paper "Key-value memory in the brain"☆31Feb 25, 2025Updated 11 months ago
- Official Repositiory for Spherical Voronoi: Directional Appearance as a Differentiable Partition of the Sphere☆70Jan 29, 2026Updated 2 weeks ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆198May 28, 2024Updated last year
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆31May 29, 2023Updated 2 years ago
- ☆35Apr 12, 2024Updated last year
- A Learnable LSH Framework for Efficient NN Training☆34Jul 22, 2021Updated 4 years ago
- BitLinear implementation☆35Jan 1, 2026Updated last month
- Wrappers for open source FPU hardware implementations.☆37Nov 27, 2025Updated 2 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆89Oct 30, 2024Updated last year
- [CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference☆30Mar 14, 2024Updated last year
- ☆32Oct 31, 2024Updated last year
- Learning Universal Predictors☆81Aug 1, 2024Updated last year
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆40Jun 22, 2024Updated last year
- rebuilds and completes models of protein complexes using AlphaFold2☆15Jan 22, 2026Updated 3 weeks ago
- Official code for `Visual Attention Emerges from Recurrent Sparse Reconstruction' (ICML 2022)☆36Jul 5, 2022Updated 3 years ago
- Dataset corresponding to the paper: "Form2Seq : A Framework for Higher-Order Form Structure Extraction"☆10Feb 17, 2021Updated 4 years ago
- CVE-Factory☆46Updated this week