Adaptation of titans-pytorch to llama models on HF
☆25Mar 6, 2025Updated last year
Alternatives and similar repositories for llama-titans
Users that are interested in llama-titans are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repo of paper LM2☆47Feb 13, 2025Updated last year
- ☆13Apr 15, 2024Updated last year
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- Agent Memory Playground: AI Agent Memory Design & Optimization Techniques☆33Aug 7, 2025Updated 7 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Integrates Imbue's Cost Aware pareto-Region Bayesian Search (CARBS) with Weights and Biases (WanDB)☆12Mar 17, 2025Updated last year
- [ICML2025] Test-Time Learning for Large Language Models☆47Jan 31, 2026Updated last month
- "Learning Stable Classifiers by Transferring Unstable Features" ICML 2022☆14Jul 24, 2022Updated 3 years ago
- Official Code Repository for the paper "Key-value memory in the brain"☆31Feb 25, 2025Updated last year
- The code of SpikingSSMs: Learning Long Sequences with Sparse and Parallel Spiking State Space Models☆22Apr 16, 2025Updated 11 months ago
- ☆30May 21, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- ☆13May 9, 2024Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official pytorch code for "APP: Anytime Progressive Pruning" (DyNN @ ICML, 2022; CLL @ ACML, 2022, SNN @ ICML, 2022 and SlowDNN 2023)☆16Nov 22, 2022Updated 3 years ago
- Titans - Learning to Memorize at Test Time☆63Jan 16, 2025Updated last year
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- HGRN2: Gated Linear RNNs with State Expansion☆56Aug 20, 2024Updated last year
- Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".☆17Dec 15, 2021Updated 4 years ago
- Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments (VIS 2024)☆28Jul 17, 2025Updated 8 months ago
- ☆11Oct 11, 2023Updated 2 years ago
- Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch☆1,935Feb 9, 2026Updated last month
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- The repository for HKU ENGG1340 Group Project (24/25 Semester 2).☆10Jun 22, 2025Updated 9 months ago
- ☆25Aug 19, 2024Updated last year
- The repository for the paper "A Visual Analytics Framework for Explaining and Diagnosing the Transfer Learning Processes".☆13Dec 21, 2020Updated 5 years ago
- ☆13Aug 4, 2022Updated 3 years ago
- Centralized cooperative reinforcement learning☆13Jan 8, 2023Updated 3 years ago
- Open, hand-typed notes by HKU students, for HKU students.☆18Sep 5, 2025Updated 6 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- A neural network library written in jax☆13Feb 3, 2025Updated last year
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆21Jan 8, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for the paper "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning". Great performance in many environments…☆38Oct 24, 2025Updated 5 months ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Feb 11, 2025Updated last year
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- This repositorie es the code of the paper Optimizing Reusable Knowledge for Continual Learning via Metalearning.☆11Oct 12, 2021Updated 4 years ago
- Accompanying repo for the paper - High-speed Autonomous Racing using Trajectory-aided Deep Reinforcement Learning☆18Jan 17, 2024Updated 2 years ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆56Feb 28, 2023Updated 3 years ago
- Reference implementation of models from Nyonic Model Factory☆12May 13, 2024Updated last year