Reimplementation of `Improving language models by retrieving from trillions of tokens`
☆19Nov 16, 2022Updated 3 years ago
Alternatives and similar repositories for MinRETRO
Users that are interested in MinRETRO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jul 16, 2024Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆50Jan 20, 2024Updated 2 years ago
- Python package for Geometric / Clifford Algebra with Pytorch.☆16Jun 2, 2026Updated last month
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 3 years ago
- A case study of efficient training of large language models using commodity hardware.☆67Aug 4, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for reproducing the paper "Neural Networks Fail to Learn Periodic Functions and How to Fix It" as part of the ML Reproducibility Cha…☆11Apr 16, 2021Updated 5 years ago
- Network representation learning on drug-target-side effects-indication graphs for side effect prediction☆13Feb 4, 2020Updated 6 years ago
- The official code of TACL 2022, "Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition".☆12Oct 18, 2021Updated 4 years ago
- A proof-of-concept implementation of Titans: models mixing long-term, short-term and persistent memories☆24Apr 9, 2025Updated last year
- ARC Community Project☆23Aug 2, 2024Updated last year
- Concept Learning Dynamics☆16Oct 29, 2024Updated last year
- ⚡ Triton implementation of Clifford algebra neural networks.☆42May 2, 2026Updated 2 months ago
- Repository for Sparse Universal Transformers☆20Oct 23, 2023Updated 2 years ago
- Benchmark scripts for comparing tutorials in PyTorch and JAX☆14Aug 25, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 3 years ago
- Implementations of Transformers for Video☆24Mar 26, 2021Updated 5 years ago
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Sep 13, 2024Updated last year
- MIMEx: Intrinsic Rewards from Masked Input Modeling [NeurIPS 2023]☆16May 17, 2023Updated 3 years ago
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆29Jun 4, 2024Updated 2 years ago
- ☆14Nov 23, 2020Updated 5 years ago
- Official code for the paper `Neural Algorithmic Reasoning for Combinatorial Optimisation`☆22Apr 3, 2026Updated 3 months ago
- ☆10Dec 10, 2024Updated last year
- ☆13Jan 20, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Reso…☆18Oct 9, 2025Updated 8 months ago
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Platform to run interactive Reinforcement Learning agents in a Minecraft Server☆57Apr 21, 2026Updated 2 months ago
- ☆27Feb 10, 2022Updated 4 years ago
- Code for the AACL 2022 Paper "This Patient Looks Like That Patient: Prototypical Networks for Interpretable Diagnosis Prediction from Cli…☆12Nov 18, 2022Updated 3 years ago
- Analysis of Russian mass media articles about internet regulation with structural topic modeling☆11May 15, 2018Updated 8 years ago
- Client library for MindStudio AI Workers☆22Apr 24, 2025Updated last year
- A Python package for analysis of geometric morphometric data.☆10Jun 7, 2018Updated 8 years ago
- ☆20Jul 17, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A lightweight and real-time DETR for aerial images detection☆48Mar 22, 2025Updated last year
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆17Apr 22, 2025Updated last year
- Karras et al. (2022) diffusion models for PyTorch☆18May 6, 2024Updated 2 years ago
- Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers☆36Apr 18, 2025Updated last year
- ☆26Jul 15, 2021Updated 4 years ago
- Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…☆12Feb 23, 2024Updated 2 years ago
- Opinionated library for managing hyperparameters and mutable state of machine learning training systems.☆19Aug 4, 2023Updated 2 years ago