Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.co/NX-AI/xLSTM-7b.
☆105Apr 8, 2026Updated last month
Alternatives and similar repositories for xlstm-jax
Users that are interested in xlstm-jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels.☆89Mar 27, 2026Updated last month
- FlashRNN - Fast RNN Kernels with I/O Awareness☆181Oct 20, 2025Updated 7 months ago
- ☆23Nov 23, 2025Updated 5 months ago
- a minimal website to get the diff of llm rewrites☆11Dec 11, 2024Updated last year
- OneNote export using Microsoft Graph API☆20Sep 2, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆53Oct 20, 2025Updated 7 months ago
- Jax/Flax implementation of DeiT and DeiT-III (ViT)☆19Dec 21, 2024Updated last year
- Mamba4Cast, a zero-shot time series forecasting model, achieves competitive performance and faster inference than transformer-based model…☆51Oct 11, 2024Updated last year
- Implementation of our NeurIPS 2019 paper: Subspace Attack: Exploiting Promising Subspaces for Query-Efficient Black-box Attacks☆10Dec 16, 2019Updated 6 years ago
- Optimised Extended LSTM for time-series forecasting☆44May 2, 2026Updated 2 weeks ago
- ☆10Nov 13, 2024Updated last year
- MLX implementation of xLSTM model by Beck et al. (2024)☆31Jun 5, 2024Updated last year
- xLSTM as Generic Vision Backbone☆490Oct 20, 2025Updated 7 months ago
- ☆30Feb 27, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆22Sep 24, 2025Updated 7 months ago
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆15Jun 26, 2025Updated 10 months ago
- Code and data for paper "(How) do Language Models Track State?"☆22Mar 31, 2025Updated last year
- Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision☆11Jul 22, 2024Updated last year
- ☆260Jun 6, 2025Updated 11 months ago
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆22Oct 16, 2025Updated 7 months ago
- ☆43Jul 16, 2025Updated 10 months ago
- Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and visi…☆32Feb 7, 2025Updated last year
- ☆28Jan 8, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆13Jan 16, 2025Updated last year
- This is the open-source code for TokenCarve.☆26Jan 23, 2026Updated 3 months ago
- ☆14Oct 21, 2024Updated last year
- Official repository for Activation-Informed Merging (AIM) of Large Language Models☆24Feb 10, 2025Updated last year
- ☆36Nov 22, 2024Updated last year
- Implementation for the AAAI '26 paper "T3Time: Tri-Modal Time Series Forecasting via Adaptive Multi-Head Alignment and Residual Fusion"☆54Dec 6, 2025Updated 5 months ago
- Synthetic data generation for bangla OCR☆19Dec 1, 2022Updated 3 years ago
- Official repository for the MMFM challenge☆25Jun 18, 2024Updated last year
- the official code of "Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation" (ECCV2024)☆13Jan 14, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- NumPy+Jax with named axes and an uncompromising attitude☆23Mar 4, 2025Updated last year
- Einsum-like high-level array sharding API for JAX☆34Jul 16, 2024Updated last year
- Flow-matching algorithms in JAX☆116Aug 12, 2024Updated last year
- Pseudo-Marginal Slice Sampling☆19Jun 8, 2016Updated 9 years ago
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- official repository for the NeurIPS 2022 paper "Adversarial Attack on Attackers: Post-Process to Mitigate Black-Box Score-Based Query Att…☆20Oct 28, 2022Updated 3 years ago
- Jax SSM Library☆48Nov 24, 2022Updated 3 years ago