Adaptation of titans-pytorch to llama models on HF
☆25Mar 6, 2025Updated last year
Alternatives and similar repositories for llama-titans
Users that are interested in llama-titans are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repo of paper LM2☆48Feb 13, 2025Updated last year
- ☆13Apr 15, 2024Updated 2 years ago
- ☆20Mar 11, 2025Updated last year
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Integrates Imbue's Cost Aware pareto-Region Bayesian Search (CARBS) with Weights and Biases (WanDB)☆12Mar 17, 2025Updated last year
- Official Code Repository for the paper "Key-value memory in the brain"☆32Feb 25, 2025Updated last year
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆35Jul 2, 2024Updated last year
- The code of SpikingSSMs: Learning Long Sequences with Sparse and Parallel Spiking State Space Models☆23Mar 25, 2026Updated 2 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- [ICML2025] Test-Time Learning for Large Language Models☆55Jan 31, 2026Updated 3 months ago
- [ 👾 ] ➡️ 💾 ➡️ { 🎮🕹️ } Extra Stable-Baselines3 buffer classes. Reducing RL memory usage drastically with minimal overhead.☆23Dec 9, 2025Updated 5 months ago
- ☆14Oct 30, 2024Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Titans - Learning to Memorize at Test Time☆67Jan 16, 2025Updated last year
- ☆43Oct 16, 2024Updated last year
- Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch☆1,959Feb 9, 2026Updated 3 months ago
- 🍔 Chen’s Private Cuisine Menu☆10Jan 4, 2026Updated 4 months ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated 2 months ago
- ☆54Jul 18, 2024Updated last year
- ☆25Aug 19, 2024Updated last year
- ☆112Mar 12, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A terminal text editor written in MoonBit☆11Apr 7, 2025Updated last year
- ☆13Aug 4, 2022Updated 3 years ago
- Open, hand-typed notes by HKU students, for HKU students.☆18Sep 5, 2025Updated 8 months ago
- ☆11Mar 20, 2025Updated last year
- A neural network library written in jax☆13Feb 3, 2025Updated last year
- Sample data associated with the Aurora-BP study☆40Mar 18, 2026Updated 2 months ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Feb 11, 2025Updated last year
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repositorie es the code of the paper Optimizing Reusable Knowledge for Continual Learning via Metalearning.☆11Oct 12, 2021Updated 4 years ago
- Accompanying repo for the paper - High-speed Autonomous Racing using Trajectory-aided Deep Reinforcement Learning☆17Jan 17, 2024Updated 2 years ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆56Feb 28, 2023Updated 3 years ago
- A few dotfiles for my Manjaro/i3 desktop.☆18Jan 16, 2019Updated 7 years ago
- Reference implementation of models from Nyonic Model Factory☆12May 13, 2024Updated 2 years ago
- Write games use jok(zig) through MoonBit(wasm).☆16Mar 24, 2025Updated last year
- Write, compile and run WebAssembly text mode in the browser☆16May 16, 2026Updated last week