The simplest, fastest repository for training/finetuning medium-sized xLSTMs.
☆41May 24, 2024Updated 2 years ago
Alternatives and similar repositories for nanoXLSTM
Users that are interested in nanoXLSTM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated last year
- ojjson is a library designed to facilitate JSON interactions with Ollama, a large language api (LLM). It leverages the power of Zod for s…☆12Nov 7, 2024Updated last year
- ☆50May 13, 2024Updated 2 years ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- ☆27Mar 13, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Course Project for COMP4471 on RWKV☆17Feb 11, 2024Updated 2 years ago
- TaCo: Enhancing Cross-Lingual Transfer for Low-Resource Languages in LLMs through Translation-Assisted Chain-of-Thought Processes☆14Jul 1, 2025Updated last year
- Official repository for the paper "Automating Continual Learning"☆20Jun 11, 2025Updated last year
- Yet Another (LLM) Web UI, made with Gemini☆12Dec 25, 2024Updated last year
- Awesome list of papers that extend Mamba to various applications.☆141Jun 4, 2026Updated last month
- Spotlight-like client for Ollama on Windows.☆28May 18, 2024Updated 2 years ago
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13May 5, 2024Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Apr 10, 2023Updated 3 years ago
- 💥 Make peer-2-peer global works☆55Jan 29, 2026Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Temporal Neural Networks☆30Mar 2, 2026Updated 4 months ago
- Rust derive macros for automating the boring stuff.☆14Aug 3, 2025Updated 11 months ago
- Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Ro…☆49Sep 2, 2025Updated 10 months ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆16Apr 30, 2025Updated last year
- Implementation of Neurips 2023 Paper "Multi Time Scale World Models"☆17Nov 8, 2024Updated last year
- Reasoning-based Evaluation and Ranking of Translations.☆19Jun 2, 2026Updated last month
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆11Jan 12, 2021Updated 5 years ago
- Framework for Self-Organizing Python Agents☆29Feb 4, 2024Updated 2 years ago
- Developing, training, and assessing the performance of a Proximal Policy Optimization (PPO) Stock Trading Agent.☆14Aug 20, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Lightweight Deep Learning Library in MATLAB☆11Jun 28, 2019Updated 7 years ago
- ☆23Nov 6, 2022Updated 3 years ago
- A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …☆18Jun 28, 2026Updated last week
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- Modified Mamba code to run on CPU☆34Jan 14, 2024Updated 2 years ago
- ☆27Jul 9, 2024Updated last year
- Repository for Sparse Universal Transformers☆20Oct 23, 2023Updated 2 years ago
- ☆13Sep 18, 2024Updated last year
- A guide to structured generation using constrained decoding☆18Jun 9, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 4 years ago
- Parsing and serialization support for PSSH boxes used in DRM systems☆16Jun 27, 2026Updated last week
- A node utility to scan a domain with various techniques.☆12Sep 10, 2020Updated 5 years ago
- ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization☆114Oct 15, 2024Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆31Jun 22, 2026Updated last week
- LLM-powered lossless compression tool☆314Jun 16, 2026Updated 2 weeks ago
- ☆14Updated this week