PyTorch implementation of Titans.
☆37Jan 20, 2025Updated last year
Alternatives and similar repositories for Titans-PyTorch
Users that are interested in Titans-PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch☆1,965Jun 6, 2026Updated 3 weeks ago
- This repository contains the code for the perspective paper "Multimodal Neural Databases" accepted at SIGIR 2023.☆20Nov 19, 2024Updated last year
- ☆27Sep 11, 2024Updated last year
- The repository contains code for Adaptive Data Optimization☆36Dec 9, 2024Updated last year
- Mixture of Experts from scratch☆14Apr 12, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Multi-agent demo platform for Titans (arXiv:2501.00663) — neural networks that learn to memorize at test time. 7 AI agents, native deskto…☆412Jun 15, 2026Updated 2 weeks ago
- The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Model…☆16Dec 11, 2023Updated 2 years ago
- Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"☆30Dec 18, 2024Updated last year
- Rust derive macros for automating the boring stuff.☆14Aug 3, 2025Updated 11 months ago
- Official PyTorch implementation for "TensorLens: End-to-End Transformer Analysis via High-Order Attention Tensors" [ACL 2026]☆47Apr 14, 2026Updated 2 months ago
- This is the implementation of Cross-attention inspired Mamba.☆41Apr 5, 2025Updated last year
- [ICML 2026] Esoteric Language Models☆120Updated this week
- Bleeding edge low level Rust binding for GGML☆17Jun 26, 2024Updated 2 years ago
- ☆11Dec 26, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- RADIX-4 SRT division☆12Oct 31, 2019Updated 6 years ago
- [NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"☆33Jan 3, 2026Updated 6 months ago
- 2027 entry-level data science & ML jobs — analytics, AI, quant & machine learning US roles☆47Jun 27, 2026Updated last week
- Advanced Dropout: A Model-free Methodology for Bayesian Dropout Optimization (IEEE TPAMI 2021)☆17Jun 4, 2021Updated 5 years ago
- Implementation of Dat2Vec2.0 for vision☆18Feb 6, 2023Updated 3 years ago
- Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public☆185May 13, 2026Updated last month
- Almost SOTA LLM architecture, with O(n) time complexity☆11Jan 19, 2025Updated last year
- This project implements the Titans architecture from the paper "Titans: Learning to Memorize at Test Time" for market data prediction.☆10Jan 19, 2025Updated last year
- Implementation of a transformer for reinforcement learning using `x-transformers`☆73Sep 25, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Jun 4, 2024Updated 2 years ago
- A browser based CadQuery server☆13Feb 18, 2025Updated last year
- An Agile RISC-V SoC Design Framework with in-order cores, out-of-order cores, accelerators, and more☆12May 29, 2026Updated last month
- TypeScript AI "code mode" toolkit with permissions and search☆65May 1, 2026Updated 2 months ago
- JPEG编解码从零开始实现(python JPEG codec)☆10Jul 29, 2022Updated 3 years ago
- ☆17Jun 28, 2026Updated last week
- Official repo of paper LM2☆48Feb 13, 2025Updated last year
- ☆43Dec 15, 2025Updated 6 months ago
- Basic floating-point components for RISC-V processors☆12Aug 13, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A implement of run-length encoding for Pytorch tensor using CUDA☆14Apr 7, 2021Updated 5 years ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆40Dec 31, 2024Updated last year
- Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.☆13Jun 5, 2024Updated 2 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- Log-structured merge-tree implementation in Rust☆19Nov 6, 2018Updated 7 years ago
- unsigned Radix-2 SRT division,基2除法☆17May 12, 2015Updated 11 years ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆16Apr 30, 2025Updated last year