Adaptation of titans-pytorch to llama models on HF
☆24Mar 6, 2025Updated last year
Alternatives and similar repositories for llama-titans
Users that are interested in llama-titans are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repo of paper LM2☆48Feb 13, 2025Updated last year
- ☆13Apr 15, 2024Updated 2 years ago
- ☆20Mar 11, 2025Updated last year
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- Evaluate computational models on their alignment to behavioral and neural measurements in the domain of language☆39Jun 8, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICML'2025] From Token to Rhythm: A Multi-Scale Approach for ECG-Language Pretraining☆29Mar 9, 2026Updated 3 months ago
- Code to train on MNIST, CIFAR-10 and ImageNet using burstprop.☆39Jun 7, 2021Updated 5 years ago
- Code repository for the IEEE VIS 2023 paper "A Comparative Visual Analytics Framework for Evaluating Evolutionary Processes in Multi-obje…☆13Jan 30, 2024Updated 2 years ago
- ☆14Apr 3, 2025Updated last year
- "Learning Stable Classifiers by Transferring Unstable Features" ICML 2022☆14Jul 24, 2022Updated 3 years ago
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆35Jul 2, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- ☆14May 9, 2024Updated 2 years ago
- [ICML2025] Test-Time Learning for Large Language Models☆54Jan 31, 2026Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆34May 21, 2024Updated 2 years ago
- [ 👾 ] ➡️ 💾 ➡️ { 🎮🕹️ } Extra Stable-Baselines3 buffer classes. Reducing RL memory usage drastically with minimal overhead.☆23Updated this week
- 🧪 Experiments in calling Zig code from MoonBit (via C ABI bridge initially), aiming for direct interop.☆11Apr 7, 2025Updated last year
- ☆14Oct 30, 2024Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 9 months ago
- Official pytorch code for "APP: Anytime Progressive Pruning" (DyNN @ ICML, 2022; CLL @ ACML, 2022, SNN @ ICML, 2022 and SlowDNN 2023)☆16Nov 22, 2022Updated 3 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".☆17Dec 15, 2021Updated 4 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆44Oct 16, 2024Updated last year
- Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch☆1,963Jun 6, 2026Updated last week
- 🍔 Chen’s Private Cuisine Menu☆10Jan 4, 2026Updated 5 months ago
- ☆54Jul 18, 2024Updated last year
- ☆113Mar 12, 2024Updated 2 years ago
- A terminal text editor written in MoonBit☆10Apr 7, 2025Updated last year
- Centralized cooperative reinforcement learning☆13Jan 8, 2023Updated 3 years ago
- Open, hand-typed notes by HKU students, for HKU students.☆18Sep 5, 2025Updated 9 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Mar 20, 2025Updated last year
- A formalization of synthetic differential geometry in Coq using infinitesimal analysis☆11Aug 29, 2021Updated 4 years ago
- A neural network library written in jax☆13Feb 3, 2025Updated last year
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆22Jan 8, 2025Updated last year
- Multi-agent Reinforcement Learning game using Advantage Actor Critic (A2C) algorithm☆14Sep 26, 2023Updated 2 years ago
- Code for the paper "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning". Great performance in many environments…☆39Oct 24, 2025Updated 7 months ago
- Sample data associated with the Aurora-BP study☆41Mar 18, 2026Updated 2 months ago