Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers"
☆39Apr 8, 2023Updated 3 years ago
Alternatives and similar repositories for Looped-Transformer
Users that are interested in Looped-Transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆44Dec 12, 2023Updated 2 years ago
- ☆85Aug 31, 2023Updated 2 years ago
- ☆21Mar 1, 2023Updated 3 years ago
- ☆10Oct 28, 2024Updated last year
- The git repository of Modular Prompted Chatbot paper☆35May 24, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure (NeurIPS 2024) + Arithmetic Transfor…☆14Oct 26, 2025Updated 7 months ago
- Jeroen Cottaar's work for the Kaggle Geophysical Waveform Inversion competition (2nd place)☆13Aug 11, 2025Updated 9 months ago
- Created Francisco Angulo de Lafuente ⚡️Deploy the DEMO⬇️☆30May 8, 2026Updated 2 weeks ago
- ☆11Aug 21, 2023Updated 2 years ago
- KV cache compression via sparse coding☆17Oct 26, 2025Updated 6 months ago
- The is the official implementation of "Lyra: Orchestrating Dual Correction in Automated Theorem Proving"☆15Jul 2, 2024Updated last year
- Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation☆16Oct 14, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Jun 26, 2024Updated last year
- TDMS 2.0 support for F# and C#☆13Dec 26, 2022Updated 3 years ago
- Official repository of "Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task models" [ICML 2023]☆24Jan 10, 2025Updated last year
- ☆17Oct 31, 2023Updated 2 years ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Jul 29, 2024Updated last year
- ☆20May 5, 2023Updated 3 years ago
- Code for the arXiv preprint "Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions"☆15Aug 2, 2025Updated 9 months ago
- ☆22Updated this week
- PyTorch implementation of StableMask (ICML'24)☆15Jun 27, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆22Jul 16, 2023Updated 2 years ago
- Legacy Code of ZJU Campus App for iOS☆11Jan 31, 2024Updated 2 years ago
- Code for Paper < >☆12Jul 16, 2024Updated last year
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆40Nov 11, 2024Updated last year
- Representation Learning in RL☆13Jun 1, 2022Updated 3 years ago
- JS[Public LB 26th] training code...☆12Jan 20, 2025Updated last year
- ☆92Aug 18, 2024Updated last year
- ☆18Jul 10, 2022Updated 3 years ago
- ☆26Jun 22, 2025Updated 11 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Official Implementation of "Visual-ERM: Reward Modeling for Visual Equivalence"☆63Mar 23, 2026Updated 2 months ago
- ☆20Dec 5, 2025Updated 5 months ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Feb 11, 2025Updated last year
- [DeepRead] This is the official implementation of the DeepRead paper.☆47May 1, 2026Updated 3 weeks ago
- [ICLR 2025] Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs☆19Mar 20, 2025Updated last year
- ☆18Jul 20, 2025Updated 10 months ago
- Collect papers related to personalized text generation☆18Sep 6, 2021Updated 4 years ago