Reimplementation of `Improving language models by retrieving from trillions of tokens`
☆19Nov 16, 2022Updated 3 years ago
Alternatives and similar repositories for MinRETRO
Users that are interested in MinRETRO are comparing it to the libraries listed below
Sorting:
- ☆16Jul 16, 2024Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Aug 4, 2022Updated 3 years ago
- Code for reproducing the paper "Neural Networks Fail to Learn Periodic Functions and How to Fix It" as part of the ML Reproducibility Cha…☆11Apr 16, 2021Updated 4 years ago
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 2 years ago
- Real-Time RTUs☆11Jan 2, 2025Updated last year
- [NAACL'25] Evaluating LLMs for Causal Queries☆13Feb 18, 2025Updated last year
- ☆14Nov 23, 2020Updated 5 years ago
- Vintix: Action Model via In-Context Reinforcement Learning - - —☆22May 23, 2025Updated 9 months ago
- Network representation learning on drug-target-side effects-indication graphs for side effect prediction☆13Feb 4, 2020Updated 6 years ago
- A TensorFlow [2.0] implementation of ProSeNet: "Interpretable and Steerable Sequence Learning via Prototypes" (Ming et al., 2019)☆12Dec 19, 2019Updated 6 years ago
- Code for the AACL 2022 Paper "This Patient Looks Like That Patient: Prototypical Networks for Interpretable Diagnosis Prediction from Cli…☆12Nov 18, 2022Updated 3 years ago
- ☆10Jun 27, 2024Updated last year
- A scalable benchmark for state representation learning in visual reinforcement learning.☆16Jun 23, 2025Updated 8 months ago
- The official code of TACL 2022, "Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition".☆11Oct 18, 2021Updated 4 years ago
- Analysis of Russian mass media articles about internet regulation with structural topic modeling☆11May 15, 2018Updated 7 years ago
- PyTorch implementation for all methods and environments in the paper "MIMEx: Intrinsic Rewards from Masked Input Modeling"☆16May 17, 2023Updated 2 years ago
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 2 years ago
- Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…☆12Feb 23, 2024Updated 2 years ago
- Extracting minimal DFA's from well-trained RNN's☆11Nov 26, 2018Updated 7 years ago
- This is a collection of interesting papers that I have read so far or want to read. Note that the list is not up-to-date. Topics: reinfor…☆11Apr 3, 2025Updated 11 months ago
- ☆13Jan 20, 2023Updated 3 years ago
- ☆12Jun 21, 2022Updated 3 years ago
- iQRL: implicitly Quantized Representations for Sample-efficient Reinforcement Learning☆12Jan 8, 2025Updated last year
- Gym env for Slay the Spire☆16Dec 31, 2024Updated last year
- ☆13Apr 25, 2024Updated last year
- Platform to run interactive Reinforcement Learning agents in a Minecraft Server☆56Feb 20, 2026Updated last week
- [ICML 2025 GenBio Workshop] Official Implementation for "Electrostatics from Laplacian Eigenbasis for Neural Network Interatomic Potentia…☆17Jun 12, 2025Updated 8 months ago
- Concept Learning Dynamics☆16Oct 29, 2024Updated last year
- Opinionated library for managing hyperparameters and mutable state of machine learning training systems.☆19Aug 4, 2023Updated 2 years ago
- Pointax: PointMaze Environment for JAX☆26Oct 22, 2025Updated 4 months ago
- GULAG: GUessing LAnGuages with neural networks☆13May 4, 2022Updated 3 years ago
- ☆13May 21, 2023Updated 2 years ago
- A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)☆14Feb 3, 2023Updated 3 years ago
- A2C training of Relational Deep Reinforcement Learning Architecture☆13Jun 22, 2022Updated 3 years ago
- ⚡ Triton implementation of Clifford algebra neural networks.☆35Oct 24, 2025Updated 4 months ago
- A scalable Dreamer implementation in JAX☆10May 22, 2022Updated 3 years ago
- Train, visualize, and evaluate RL policies for the Terra environment.☆18Feb 10, 2026Updated 3 weeks ago