Reimplementation of `Improving language models by retrieving from trillions of tokens`
☆19Nov 16, 2022Updated 3 years ago
Alternatives and similar repositories for MinRETRO
Users that are interested in MinRETRO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jul 16, 2024Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- Python package for Geometric / Clifford Algebra with Pytorch.☆15Jun 2, 2026Updated last week
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 3 years ago
- A case study of efficient training of large language models using commodity hardware.☆67Aug 4, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for reproducing the paper "Neural Networks Fail to Learn Periodic Functions and How to Fix It" as part of the ML Reproducibility Cha…☆11Apr 16, 2021Updated 5 years ago
- AI powered Virtual Desktop☆16Jun 7, 2026Updated last week
- Network representation learning on drug-target-side effects-indication graphs for side effect prediction☆13Feb 4, 2020Updated 6 years ago
- The official code of TACL 2022, "Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition".☆12Oct 18, 2021Updated 4 years ago
- A proof-of-concept implementation of Titans: models mixing long-term, short-term and persistent memories☆24Apr 9, 2025Updated last year
- [NAACL'25] Evaluating LLMs for Causal Queries☆14Feb 18, 2025Updated last year
- ARC Community Project☆23Aug 2, 2024Updated last year
- Concept Learning Dynamics☆16Oct 29, 2024Updated last year
- Benchmark scripts for comparing tutorials in PyTorch and JAX☆14Aug 25, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Like ARC, but code to generate visual puzzles. 1D puzzles first.☆23Aug 17, 2024Updated last year
- A repository for paper Joint Embedding Predictive Architectures Focus on Slow Features☆27Oct 27, 2022Updated 3 years ago
- Code for ACL2022 publication Transkimmer: Transformer Learns to Layer-wise Skim☆22Aug 21, 2022Updated 3 years ago
- Code to the paper: The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence☆32Jul 31, 2025Updated 10 months ago
- Implementations of Transformers for Video☆24Mar 26, 2021Updated 5 years ago
- MIMEx: Intrinsic Rewards from Masked Input Modeling [NeurIPS 2023]☆16May 17, 2023Updated 3 years ago
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆29Jun 4, 2024Updated 2 years ago
- ☆14Nov 23, 2020Updated 5 years ago
- ☆10Dec 10, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆13Jan 20, 2023Updated 3 years ago
- Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Reso…☆18Oct 9, 2025Updated 8 months ago
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Platform to run interactive Reinforcement Learning agents in a Minecraft Server☆57Apr 21, 2026Updated last month
- ☆27Feb 10, 2022Updated 4 years ago
- Client library for MindStudio AI Workers☆22Apr 24, 2025Updated last year
- A Python package for analysis of geometric morphometric data.☆10Jun 7, 2018Updated 8 years ago
- A lightweight and real-time DETR for aerial images detection☆48Mar 22, 2025Updated last year
- ☆10Jun 27, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Karras et al. (2022) diffusion models for PyTorch☆18May 6, 2024Updated 2 years ago
- Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers☆36Apr 18, 2025Updated last year
- Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…☆12Feb 23, 2024Updated 2 years ago
- Opinionated library for managing hyperparameters and mutable state of machine learning training systems.☆19Aug 4, 2023Updated 2 years ago
- PyTorch interface for TrueGrad Optimizers☆43Aug 8, 2023Updated 2 years ago
- A TensorFlow [2.0] implementation of ProSeNet: "Interpretable and Steerable Sequence Learning via Prototypes" (Ming et al., 2019)☆13Dec 19, 2019Updated 6 years ago
- ☆12Jun 21, 2022Updated 3 years ago