Reimplementation of `Improving language models by retrieving from trillions of tokens`
☆19Nov 16, 2022Updated 3 years ago
Alternatives and similar repositories for MinRETRO
Users that are interested in MinRETRO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jul 16, 2024Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- Python package for Geometric / Clifford Algebra with Pytorch.☆14Jan 25, 2026Updated 2 months ago
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 2 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Aug 4, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for reproducing the paper "Neural Networks Fail to Learn Periodic Functions and How to Fix It" as part of the ML Reproducibility Cha…☆11Apr 16, 2021Updated 4 years ago
- The official code of TACL 2022, "Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition".☆12Oct 18, 2021Updated 4 years ago
- A proof-of-concept implementation of Titans: models mixing long-term, short-term and persistent memories☆24Apr 9, 2025Updated last year
- [NAACL'25] Evaluating LLMs for Causal Queries☆13Feb 18, 2025Updated last year
- ARC Community Project☆22Aug 2, 2024Updated last year
- Concept Learning Dynamics☆16Oct 29, 2024Updated last year
- Ladder Side-Tuning在CLUE上的简单尝试☆22Jun 20, 2022Updated 3 years ago
- ⚡ Triton implementation of Clifford algebra neural networks.☆37Oct 24, 2025Updated 5 months ago
- Repository for Sparse Universal Transformers☆20Oct 23, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Like ARC, but code to generate visual puzzles. 1D puzzles first.☆23Aug 17, 2024Updated last year
- https://arxiv.org/pdf/2506.06677☆51Nov 10, 2025Updated 5 months ago
- Implementation of the paper by Google, Transformer Memory As A Differentiable Search Index☆16May 27, 2022Updated 3 years ago
- Code for ACL2022 publication Transkimmer: Transformer Learns to Layer-wise Skim☆22Aug 21, 2022Updated 3 years ago
- 2024秋SJTU中马复习资料汇总☆24Feb 19, 2026Updated last month
- Code to the paper: The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence☆28Jul 31, 2025Updated 8 months ago
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 2 years ago
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Sep 13, 2024Updated last year
- MIMEx: Intrinsic Rewards from Masked Input Modeling [NeurIPS 2023]☆16May 17, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆27Jun 4, 2024Updated last year
- 2024秋SJTU自然辩证法复习资料汇总☆26Feb 19, 2026Updated last month
- Official code for the paper `Neural Algorithmic Reasoning for Combinatorial Optimisation`☆21Apr 3, 2026Updated last week
- ☆13Jan 20, 2023Updated 3 years ago
- Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Reso…☆18Oct 9, 2025Updated 6 months ago
- Paper Implementation for "Parameter-Efficient Transfer Learning for NLP"☆17Aug 28, 2023Updated 2 years ago
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Platform to run interactive Reinforcement Learning agents in a Minecraft Server☆57Feb 20, 2026Updated last month
- ☆27Feb 10, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for the AACL 2022 Paper "This Patient Looks Like That Patient: Prototypical Networks for Interpretable Diagnosis Prediction from Cli…☆12Nov 18, 2022Updated 3 years ago
- Analysis of Russian mass media articles about internet regulation with structural topic modeling☆11May 15, 2018Updated 7 years ago
- Client library for MindStudio AI Workers☆23Apr 24, 2025Updated 11 months ago
- ☆19Jul 17, 2019Updated 6 years ago
- ☆10Jun 27, 2024Updated last year
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆17Apr 22, 2025Updated 11 months ago
- Karras et al. (2022) diffusion models for PyTorch☆18May 6, 2024Updated last year