Adaptation of titans-pytorch to llama models on HF
☆25Mar 6, 2025Updated last year
Alternatives and similar repositories for llama-titans
Users that are interested in llama-titans are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Apr 15, 2024Updated 2 years ago
- ☆18Mar 11, 2025Updated last year
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- Evaluate computational models on their alignment to behavioral and neural measurements in the domain of language☆37Apr 2, 2026Updated 2 weeks ago
- Agent Memory Playground: AI Agent Memory Design & Optimization Techniques☆35Aug 7, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code to train on MNIST, CIFAR-10 and ImageNet using burstprop.☆39Jun 7, 2021Updated 4 years ago
- ☆13Dec 4, 2024Updated last year
- ☆13Apr 3, 2025Updated last year
- [ICML2025] Test-Time Learning for Large Language Models☆51Jan 31, 2026Updated 2 months ago
- "Learning Stable Classifiers by Transferring Unstable Features" ICML 2022☆14Jul 24, 2022Updated 3 years ago
- Official Code Repository for the paper "Key-value memory in the brain"☆31Feb 25, 2025Updated last year
- ☆21Feb 13, 2026Updated 2 months ago
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆35Jul 2, 2024Updated last year
- The code of SpikingSSMs: Learning Long Sequences with Sparse and Parallel Spiking State Space Models☆22Mar 25, 2026Updated 3 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- ☆14Oct 30, 2024Updated last year
- Titans - Learning to Memorize at Test Time☆64Jan 16, 2025Updated last year
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".☆17Dec 15, 2021Updated 4 years ago
- HGRN2: Gated Linear RNNs with State Expansion☆57Aug 20, 2024Updated last year
- ☆11Oct 11, 2023Updated 2 years ago
- Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch☆1,948Feb 9, 2026Updated 2 months ago
- ☆41Oct 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆19Feb 18, 2025Updated last year
- 🍔 Chen’s Private Cuisine Menu☆10Jan 4, 2026Updated 3 months ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated 3 weeks ago
- ☆109Mar 12, 2024Updated 2 years ago
- ☆53Jul 18, 2024Updated last year
- The repository for HKU ENGG1340 Group Project (24/25 Semester 2).☆10Jun 22, 2025Updated 9 months ago
- The repository for the paper "A Visual Analytics Framework for Explaining and Diagnosing the Transfer Learning Processes".☆13Dec 21, 2020Updated 5 years ago
- A terminal text editor written in MoonBit☆11Apr 7, 2025Updated last year
- single-GPU to multi-GPU training of PyTorch apps at NERSC☆22Apr 10, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Open, hand-typed notes by HKU students, for HKU students.☆18Sep 5, 2025Updated 7 months ago
- ☆11Mar 20, 2025Updated last year
- A formalization of synthetic differential geometry in Coq using infinitesimal analysis☆11Aug 29, 2021Updated 4 years ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- An End-to-end Transformer for Alzheimer's Disease Detection☆22Aug 5, 2025Updated 8 months ago
- ☆18Mar 25, 2021Updated 5 years ago
- A training program for freshmem☆14Jul 29, 2021Updated 4 years ago