lucidrains/ultra-mem

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lucidrains/ultra-mem)

lucidrains / ultra-mem

Implementation of UltraMem, improved Product Key Memory design, from Bytedance AI labs

☆28

Alternatives and similar repositories for ultra-mem

Users that are interested in ultra-mem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lucidrains / strassen-attention
View on GitHub
Implementation of Strassen attention, from Kozachinskiy et al. of National Center of AI in Chile
☆41Jul 8, 2025Updated last year
lucidrains / simplicial-attention
View on GitHub
Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Ro…
☆49Sep 2, 2025Updated 10 months ago
lucidrains / RIM-pytorch
View on GitHub
Implementation of Recurrent Independent Mechanisms in Pytorch
☆27Apr 6, 2026Updated 3 months ago
lucidrains / hippoformer
View on GitHub
Unofficial implementation of Hippoformer, Integrating Hippocampus-inspired Spatial Memory with Transformers
☆53Apr 28, 2026Updated 2 months ago
lucidrains / populora
View on GitHub
Implementation and explorations into PopuLoRA, Co-Evolving LLM Populations for Reasoning Self-Play
☆15Jun 3, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lucidrains / lookahead-keys-attention
View on GitHub
Causal Attention with Lookahead Keys
☆28Sep 26, 2025Updated 10 months ago
lucidrains / x-evolution
View on GitHub
Implementation of various evolutionary algorithms, starting with evolutionary strategies
☆51May 10, 2026Updated 2 months ago
lucidrains / tiny-recursive-model
View on GitHub
Unofficial implementation of Tiny Recursive Model (TRM), improvement to HRM from Sapient AI, by Alexia Jolicoeur-Martineau
☆191Dec 23, 2025Updated 7 months ago
Debrup-61 / RaDeR
View on GitHub
Official Code Repositiry for "RaDeR: Reasoning-aware Dense Retrieval Models" accepted at Main Conference EMNLP 2025
☆18Jun 23, 2025Updated last year
allbilly / ane
View on GitHub
Run ops on Apple ANE in NPU register with pure python on M1 Asahi Linux. No Espresso, No CoreML, no metal, no .mlmodels file, no .hwx fil…
☆16Jun 28, 2026Updated 3 weeks ago
SRSWTI / shadows
View on GitHub
a fast and lightweight distributed background task processing framework with seamless scheduling.
☆15Mar 30, 2026Updated 3 months ago
lucidrains / disco-rl-pytorch
View on GitHub
Implementation and explorations into DiscoRL, Discovering state-of-the-art reinforcement learning algorithms, David Silver's last work at…
☆21Jun 13, 2026Updated last month
lucidrains / fast-weight-product-key-memory
View on GitHub
Implementation of the fast weight product key memory from Sakana AI
☆19Apr 1, 2026Updated 3 months ago
stanford-oval / sliders
View on GitHub
Repository for paper: Contexts are Never Long Enough: Structured Reasoning for Scalable Question Answering over Long Document Sets
☆27Apr 27, 2026Updated 3 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
lucidrains / poly-attention
View on GitHub
Implementation of Poly-attention, a higher-order self-attention proposed by Chakrabarti et al. of Columbia
☆52Updated this week
lucidrains / evolutionary-policy-optimization
View on GitHub
Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of the Robotics Institute at Carnegie Mellon University
☆110May 18, 2026Updated 2 months ago
ars22 / e3
View on GitHub
☆20Sep 16, 2025Updated 10 months ago
lucidrains / improving-transformers-world-model-for-rl
View on GitHub
Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch
☆155May 2, 2025Updated last year
wu-kan / wuk_cupti_wrapper
View on GitHub
a simple API to use CUPTI
☆10Aug 19, 2025Updated 11 months ago
lucidrains / multiscreen
View on GitHub
Implementation of Multiscreen proposed by Ken Nakanishi for "Screening is Enough"
☆18May 13, 2026Updated 2 months ago
ASK-Berkeley / graph-free-transformer
View on GitHub
☆16Feb 9, 2026Updated 5 months ago
neuraloperator / NNs-to-NOs
View on GitHub
☆24May 21, 2026Updated 2 months ago
BurakGurbuz97 / NICE
View on GitHub
NICE: Neurogenesis Inspired Contextual Encoding for Replay-free Class Incremental Learning
☆29Jul 28, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
zlab-princeton / llm-distillation-jax
View on GitHub
JAX implementation of configurable LLM distillation training
☆24Nov 15, 2025Updated 8 months ago
lucidrains / adam-atan2-pytorch
View on GitHub
Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch
☆143Jul 17, 2026Updated last week
ESHyperscale / HyperscaleES
View on GitHub
Jax Codebase for Evolutionary Strategies at the Hyperscale
☆350Feb 27, 2026Updated 4 months ago
tim-lawson / skip-middle
View on GitHub
Learning to Skip the Middle Layers of Transformers
☆17Aug 7, 2025Updated 11 months ago
lucidrains / PoPE-pytorch
View on GitHub
Efficient implementation (and explorations) into polar coordinate positional embedding (PoPE) - from Gopalakrishnan et al. under Schmidhu…
☆71Jun 21, 2026Updated last month
lucidrains / EvoTune
View on GitHub
Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.
☆17Apr 22, 2025Updated last year
apple / ml-pararnn
View on GitHub
☆192Oct 31, 2025Updated 8 months ago
LaunchPlatform / marketplace
View on GitHub
Marketplace ML experiment - training without backprop
☆28Sep 9, 2025Updated 10 months ago
EleutherAI / training-jacobian
View on GitHub
☆24Dec 11, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
letta-ai / recovery-bench
View on GitHub
Recovery-Bench is a benchmark for evaluating the capability of LLM agents to recover from mistakes
☆27Jun 17, 2026Updated last month
camenduru / TANGO-jupyter
View on GitHub
☆13Oct 14, 2024Updated last year
anadim / subleq-transformer
View on GitHub
A transformer that executes a one-instruction Turing-complete computer — two approaches: hand-coded weights (no training) and learned fro…
☆41Mar 3, 2026Updated 4 months ago
kostarion / guided-diffusion
View on GitHub
☆13Jun 7, 2023Updated 3 years ago
Hmbown / ZMLX
View on GitHub
Triton‑style kernel toolkit for MLX plus a small upstream incubator: prototype, benchmark, and upstream fusions for Apple Silicon
☆47Mar 31, 2026Updated 3 months ago
lucidrains / dreamer4
View on GitHub
Implementation of Danijar's latest iteration for his Dreamer line of work
☆209Jul 4, 2026Updated 3 weeks ago
lucidrains / neat
View on GitHub
Explorations into NEAT and some of its derivative research
☆41Jul 6, 2026Updated 3 weeks ago