☆53Nov 22, 2024Updated last year
Alternatives and similar repositories for mamba2-torch
Users that are interested in mamba2-torch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Minimal Mamba-2 implementation in PyTorch☆253Jun 17, 2024Updated 2 years ago
- PyTorch implementation of models from the Zamba2 series.☆194Jan 23, 2025Updated last year
- Implementation of Microscaling data formats in SystemVerilog.☆33Jul 6, 2025Updated 11 months ago
- the xoroshiro32++ and xoroshiro64++ PRNG algorthims by David Blackman and Sebastiano Vigna in C++, Verilog, VHDL and SpinalHDL.☆16Dec 2, 2018Updated 7 years ago
- The official repository of Quamba1 [ICLR 2025] & Quamba2 [ICML 2025]☆70Jun 19, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A PYNQ overlay demonstrating the Xilinx RFSoC SD-FEC☆13Jun 29, 2022Updated 3 years ago
- Program to scan for malicious FPGA designs.☆17Mar 20, 2021Updated 5 years ago
- A bunch of kernels that might make stuff slower 😉☆91Updated this week
- A fork of llama3.c used to do some R&D on inferencing☆23Dec 20, 2024Updated last year
- '내마리'는 나의 이야기에 귀를 기울임으로써 나에게 공감하고, 이야기의 맥락을 파악하고, 더 깊은 내용을 질문해주는 챗봇입니다.☆13Sep 9, 2023Updated 2 years ago
- A Range-Null Space Decomposition Approach for Fast and Flexible Spectral Compressive Imaging☆11May 18, 2023Updated 3 years ago
- Based on the mHC architecture proposed by deepseek, the residual links of the existing iTransformer are replaced and updated to obtain a …☆31Mar 18, 2026Updated 3 months ago
- Perceptron-based branch predictor written in C++☆14Dec 14, 2016Updated 9 years ago
- A minimal WebRTC SFU Implementation☆21Jun 15, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆21May 12, 2026Updated last month
- This document adopts the method from the XAPP1230 for doing readback capture on Xilinx UltraScale devices and shows how to migrate the sa…☆18Nov 15, 2019Updated 6 years ago
- Collect papers about Mamba (a selective state space model).☆15Aug 6, 2024Updated last year
- Repository for work on on Xilinx's matrix vector activation unit's RTL implementation. Documentation available at: https://asadalam.githu…☆20Jan 21, 2022Updated 4 years ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆34Mar 26, 2026Updated 2 months ago
- ☆10Apr 25, 2024Updated 2 years ago
- TensorRT-in-Action 是一个 GitHub 代码库,提供了使用 TensorRT 的代码示例,并有对应 Jupyter Notebook。☆15Jun 1, 2023Updated 3 years ago
- MACKO: Sparse matrix vector multiplication for low sparsity☆38Apr 6, 2026Updated 2 months ago
- 100G Udp Link For axi Stream☆16Jun 27, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- codes for RFSR: Improving ISR Diffusion Models via Reward Feedback Learning☆18Dec 8, 2024Updated last year
- Scripts used to setup a Spark cluster on EC2☆21Mar 24, 2016Updated 10 years ago
- Apply Graph Neural Networks to Optimize Factor Feature Extraction of FactorVAE☆13Jan 11, 2025Updated last year
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,956Mar 8, 2024Updated 2 years ago
- ☆19Sep 15, 2022Updated 3 years ago
- ☆11Oct 10, 2019Updated 6 years ago
- ☆17Nov 28, 2024Updated last year
- [EMNLP'22] Weakly-Supervised Temporal Article Grounding☆14Nov 25, 2023Updated 2 years ago
- Utilizing MATLAB to show how SVD can be used to compress colored images☆10May 28, 2017Updated 9 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆62Apr 16, 2026Updated 2 months ago
- Smoothing video traffic to make it a friendlier internet neighbor☆14Apr 23, 2024Updated 2 years ago
- Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…☆24Oct 13, 2025Updated 8 months ago
- [CVPR 2023] Metadata-Based RAW Reconstruction via Implicit Neural Functions☆11Jun 11, 2023Updated 3 years ago
- Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE☆20Sep 22, 2021Updated 4 years ago
- A simple Tensorflow implementation of https://arxiv.org/abs/1906.04985☆13May 16, 2019Updated 7 years ago
- Repository of PIXAR, a Pixel-based Auto-Regressive Language Model☆20Sep 15, 2025Updated 9 months ago