The simplest, fastest repository for training/finetuning medium-sized xLSTMs.
☆41May 24, 2024Updated 2 years ago
Alternatives and similar repositories for nanoXLSTM
Users that are interested in nanoXLSTM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Combining SOAP and MUON☆22Feb 11, 2025Updated last year
- slowly building a set of infinite riddle generators for data-hungry methods☆14Nov 15, 2022Updated 3 years ago
- ojjson is a library designed to facilitate JSON interactions with Ollama, a large language api (LLM). It leverages the power of Zod for s…☆12Nov 7, 2024Updated last year
- ☆50May 13, 2024Updated 2 years ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Modular task agnostic training pipeline using LFM2 from Liquid AI with unsloth.☆16Sep 13, 2025Updated 8 months ago
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆20Mar 9, 2025Updated last year
- ☆27Mar 13, 2024Updated 2 years ago
- Course Project for COMP4471 on RWKV☆17Feb 11, 2024Updated 2 years ago
- Yet Another (LLM) Web UI, made with Gemini☆12Dec 25, 2024Updated last year
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13May 5, 2024Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Apr 10, 2023Updated 3 years ago
- Luber : A ridesharing App☆14Dec 13, 2017Updated 8 years ago
- 💥 Make peer-2-peer global works☆52Jan 29, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of Danijar's latest iteration for his Dreamer line of work☆185May 18, 2026Updated last week
- Rust derive macros for automating the boring stuff.☆14Aug 3, 2025Updated 9 months ago
- ☆12Aug 6, 2020Updated 5 years ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 10 months ago
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆10Jan 12, 2021Updated 5 years ago
- Neurosity EEG Dataset repository☆29Apr 8, 2024Updated 2 years ago
- Bleeding edge low level Rust binding for GGML☆17Jun 26, 2024Updated last year
- ☆15Oct 31, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Developing, training, and assessing the performance of a Proximal Policy Optimization (PPO) Stock Trading Agent.☆14Aug 20, 2025Updated 9 months ago
- A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …☆18Dec 8, 2025Updated 5 months ago
- Almost SOTA LLM architecture, with O(n) time complexity☆11Jan 19, 2025Updated last year
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- Modified Mamba code to run on CPU☆32Jan 14, 2024Updated 2 years ago
- ☆27Jul 9, 2024Updated last year
- A few models converted from caffe to CoreMLs format.☆15Jun 6, 2017Updated 8 years ago
- ☆12Sep 18, 2024Updated last year
- A converter and basic tester for rwkv onnx☆44Jan 29, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization☆113Oct 15, 2024Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆31Updated this week
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- LLM-powered lossless compression tool☆311Jan 2, 2026Updated 4 months ago
- ☆16Apr 20, 2026Updated last month
- Simple Streamlit UI for Ollama☆22May 13, 2024Updated 2 years ago
- Simple UI cli LLaMA Model Finetuning☆10Mar 23, 2023Updated 3 years ago