Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.
☆63Jan 30, 2025Updated last year
Alternatives and similar repositories for MemoryMosaics
Users that are interested in MemoryMosaics are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sirius, an efficient correction mechanism, which significantly boosts Contextual Sparsity models on reasoning tasks while maintaining its…☆21Sep 10, 2024Updated last year
- Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…☆12Feb 23, 2024Updated 2 years ago
- computation of convolutional kernels (CKN and NTK) in C++☆14Dec 13, 2022Updated 3 years ago
- [Poster; ICLR 2026] [Oral; Neurips OPT2024] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers☆16Apr 15, 2026Updated 2 months ago
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆31Nov 14, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A deep learning-powered visual navigation engine to enables autonomous navigation of pocket-size quadrotor - running on PULP☆13Oct 30, 2024Updated last year
- Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"☆13Oct 23, 2021Updated 4 years ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Feb 27, 2024Updated 2 years ago
- Vortex: Programmable Sparse Attention for Agents as Algorithm Designers☆63Jun 24, 2026Updated last week
- ☆33Nov 11, 2024Updated last year
- Official Code Repo for the paper "Learning to Play Atari in a World of Tokens" accepted at ICML, 2024☆11Jun 6, 2024Updated 2 years ago
- Proximal Asynchronous SAGA☆13Nov 30, 2017Updated 8 years ago
- Code repo for the NeurIPS 2021 paper "Online Adaption to Label Distribution Shift".☆16Feb 15, 2023Updated 3 years ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆92Oct 30, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆43Feb 24, 2023Updated 3 years ago
- Code for the paper: Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains☆11Nov 12, 2021Updated 4 years ago
- [ICLR 2021] "Robust Overfitting may be mitigated by properly learned smoothening" by Tianlong Chen*, Zhenyu Zhang*, Sijia Liu, Shiyu Chan…☆49Dec 30, 2021Updated 4 years ago
- ☆11Mar 31, 2022Updated 4 years ago
- ☆17Dec 19, 2024Updated last year
- Repo for the research paper "SecAlign: Defending Against Prompt Injection with Preference Optimization"☆98Jun 20, 2026Updated last week
- a benchmark to evaluate the situated inductive reasoning☆16Jan 7, 2025Updated last year
- The official Languini Kitchen repository☆14May 6, 2024Updated 2 years ago
- Stick-breaking attention☆63Jul 1, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- minimal Energy-based transformer☆44Dec 11, 2025Updated 6 months ago
- ☆32Mar 1, 2024Updated 2 years ago
- ☆23Sep 2, 2025Updated 9 months ago
- rules for writing and typesetting☆26Oct 1, 2021Updated 4 years ago
- A framework for deploying on-demand distributed-trust.☆14Jun 4, 2024Updated 2 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- ☆10Aug 25, 2016Updated 9 years ago
- EQUATE (Evaluating Quantitative Understanding Aptitude in Textual Entailment), framework for evaluating quantitative reasoning ability in…☆14Feb 13, 2022Updated 4 years ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆41Oct 17, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Official implementation of "Traveling Waves Encode the Recent Past and Enhance Sequence Learning" (ICLR 2024)☆12Mar 15, 2024Updated 2 years ago
- V-MPO torch version with DMLab30 and GTrXL☆13Mar 1, 2021Updated 5 years ago
- ☆14Jun 24, 2024Updated 2 years ago
- Triton-based implementation of Sparse Mixture of Experts.☆278Oct 3, 2025Updated 8 months ago
- This is the implementation for the NeurIPS 2022 paper: ZIN: When and How to Learn Invariance Without Environment Partition?☆22Dec 3, 2022Updated 3 years ago
- Hybrid Discriminative-Generative Training via Contrastive Learning☆75May 1, 2023Updated 3 years ago
- Official code for Slot-Transformer for Videos (STEVE)☆51Jan 9, 2023Updated 3 years ago