Reimplementation of `Improving language models by retrieving from trillions of tokens`
☆19Nov 16, 2022Updated 3 years ago
Alternatives and similar repositories for MinRETRO
Users that are interested in MinRETRO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jul 16, 2024Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- Python package for Geometric / Clifford Algebra with Pytorch.☆14Jan 25, 2026Updated 2 months ago
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 2 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Aug 4, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for reproducing the paper "Neural Networks Fail to Learn Periodic Functions and How to Fix It" as part of the ML Reproducibility Cha…☆11Apr 16, 2021Updated 4 years ago
- The official code of TACL 2022, "Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition".☆12Oct 18, 2021Updated 4 years ago
- A proof-of-concept implementation of Titans: models mixing long-term, short-term and persistent memories☆24Apr 9, 2025Updated 11 months ago
- ARC Community Project☆22Aug 2, 2024Updated last year
- Concept Learning Dynamics☆16Oct 29, 2024Updated last year
- Like ARC, but code to generate visual puzzles. 1D puzzles first.☆22Aug 17, 2024Updated last year
- Repository for Sparse Universal Transformers☆20Oct 23, 2023Updated 2 years ago
- Benchmark scripts for comparing tutorials in PyTorch and JAX☆14Aug 25, 2022Updated 3 years ago
- Code for ACL2022 publication Transkimmer: Transformer Learns to Layer-wise Skim☆22Aug 21, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code to the paper: The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence☆25Jul 31, 2025Updated 7 months ago
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 2 years ago
- MIMEx: Intrinsic Rewards from Masked Input Modeling [NeurIPS 2023]☆16May 17, 2023Updated 2 years ago
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆27Jun 4, 2024Updated last year
- ☆14Nov 23, 2020Updated 5 years ago
- ☆13Jan 20, 2023Updated 3 years ago
- Paper Implementation for "Parameter-Efficient Transfer Learning for NLP"☆17Aug 28, 2023Updated 2 years ago
- Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Reso…☆18Oct 9, 2025Updated 5 months ago
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Platform to run interactive Reinforcement Learning agents in a Minecraft Server☆57Feb 20, 2026Updated last month
- Analysis of Russian mass media articles about internet regulation with structural topic modeling☆11May 15, 2018Updated 7 years ago
- A Python package for analysis of geometric morphometric data.☆10Jun 7, 2018Updated 7 years ago
- ☆19Jul 17, 2019Updated 6 years ago
- ☆10Jun 27, 2024Updated last year
- Karras et al. (2022) diffusion models for PyTorch☆18May 6, 2024Updated last year
- Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers☆36Apr 18, 2025Updated 11 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆17Apr 22, 2025Updated 11 months ago
- Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…☆12Feb 23, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Opinionated library for managing hyperparameters and mutable state of machine learning training systems.☆19Aug 4, 2023Updated 2 years ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆29Aug 19, 2025Updated 7 months ago
- PyTorch interface for TrueGrad Optimizers☆43Aug 8, 2023Updated 2 years ago
- A TensorFlow [2.0] implementation of ProSeNet: "Interpretable and Steerable Sequence Learning via Prototypes" (Ming et al., 2019)☆12Dec 19, 2019Updated 6 years ago
- "Efficient Federated Learning for Modern NLP", to appear at MobiCom 2023.☆33Aug 18, 2023Updated 2 years ago
- ☆12Jun 21, 2022Updated 3 years ago
- A scalable benchmark for state representation learning in visual reinforcement learning.☆17Jun 23, 2025Updated 9 months ago