Trying out the Mamba architecture on small examples (cifar-10, shakespeare char level etc.)
☆47Dec 12, 2023Updated 2 years ago
Alternatives and similar repositories for mamba_small_bench
Users that are interested in mamba_small_bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Sep 1, 2023Updated 2 years ago
- Understand what physics/algorithms do transformers learn internally when trained on planetary motion☆43Feb 9, 2026Updated 4 months ago
- 🏃♀️🏃♂️ ⏳ A Julia wrapper for wasmtime☆13Oct 3, 2023Updated 2 years ago
- Fast, gpu-accelerated distance transforms☆16Jun 28, 2026Updated last week
- SymPy with PythonCall backend (not PyCall)☆12Feb 19, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- it's just bytes☆13Apr 16, 2025Updated last year
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,960Mar 8, 2024Updated 2 years ago
- ProToPortal: The Portal to the Magic of PromptingTools and Julia-first LLM Coding☆17Jul 14, 2024Updated last year
- Software phantoms for image reconstruction☆16Jun 23, 2026Updated last week
- ☆107Mar 9, 2024Updated 2 years ago
- Griffin MQA + Hawk Linear RNN Hybrid☆89Apr 13, 2026Updated 2 months ago
- Flux reconstruction fluid flow solver for 1D PDEs written in Julia. Linear advection, Burgers, viscous Burgers, and Euler equations.☆14Apr 28, 2022Updated 4 years ago
- ☆15Mar 15, 2022Updated 4 years ago
- ☆28Jun 9, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆12Jun 5, 2024Updated 2 years ago
- A Haskell library for building incremental static site generators☆14Nov 30, 2023Updated 2 years ago
- Fast Kolmogorov-Arnold Network in JAX, initial experiments☆16May 20, 2024Updated 2 years ago
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆19Jun 5, 2023Updated 3 years ago
- ☆14Jul 25, 2023Updated 2 years ago
- A machine learning library capable of training various deep neural networks (RNNs, LSTMs, DBNs, ect...) on a GPU. It makes use of auto-di…☆10Aug 28, 2018Updated 7 years ago
- Simple, low-level views into memory in Julia☆24Apr 12, 2026Updated 2 months ago
- Representing machine learning models using mathematical programming☆19Aug 21, 2024Updated last year
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Oct 21, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Cross Atlas Remapping via Optimal Transport☆12Dec 14, 2023Updated 2 years ago
- Autosuggestions for function keywords☆20Mar 23, 2026Updated 3 months ago
- ☆15Oct 31, 2023Updated 2 years ago
- Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP☆11Jan 29, 2024Updated 2 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated this week
- A MATLAB function library containing encoders, decoders and weight enumerators for Reed-Muller codes.☆13Aug 19, 2023Updated 2 years ago
- Code for paper 'Zero-Shot Scene Graph Generation via Triplet Calibration and Reduction' (TOMM 2023)☆10Sep 6, 2025Updated 9 months ago
- ☆23May 29, 2022Updated 4 years ago
- Pre-computed IDF stats over all EN Wiki articles☆13Jan 30, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Audio-only Emotion Detection using Federated Learning☆10Dec 8, 2022Updated 3 years ago
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- ☆24Jan 21, 2024Updated 2 years ago
- ☆13Mar 9, 2024Updated 2 years ago
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated last year
- ☆19Mar 4, 2025Updated last year
- Causal Reasoning for Membership Inference Attacks☆11Oct 21, 2022Updated 3 years ago