[NeurIPS 2024] BLAST: Block Level Adaptive Structured Matrix for Efficient Deep Neural Network Inference
☆18Nov 6, 2024Updated last year
Alternatives and similar repositories for BLAST
Users that are interested in BLAST are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Attention-based end-to-end ASR on TIMIT in PyTorch☆18Nov 9, 2021Updated 4 years ago
- A curated list for Efficient Large Language Models☆11Mar 25, 2024Updated 2 years ago
- New RowHammer mitigation mechanism that is area-, performance-, and energy-efficient especially at very low (e.g., 125) RowHammer thresho…☆17May 2, 2024Updated 2 years ago
- Code for the note "NF4 Isn't Information Theoretically Optimal (and that's Good)☆20Jun 22, 2023Updated 3 years ago
- This repository contains the python scripts developed as a part of the work presented in the paper "Low-latency auditory spatial attentio…☆10Sep 15, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Systolic-array based Deep Learning Accelerator generator☆29Dec 11, 2020Updated 5 years ago
- This repository contains the python scripts developed as a part of the work presented in the paper "STAnet: A Spatiotemporal Attention Ne…☆15May 10, 2023Updated 3 years ago
- 32-bit RISC-V based processor with memory controler☆16Sep 2, 2022Updated 3 years ago
- Eyeriss Hardware Accelerator for Machine Learning☆13May 29, 2022Updated 4 years ago
- ☆17Jun 11, 2025Updated last year
- FPGA implement of 8x8 weight stationary systolic array DNN accelerator☆18Feb 27, 2021Updated 5 years ago
- Stochastic Cellular Automata epidemic models in Python with 2D simulations☆15Feb 24, 2020Updated 6 years ago
- Performs a faster tensor train (TT) decomposition for large sparse data☆14Sep 7, 2020Updated 5 years ago
- Implementation for the paper 'Momentum Stiefel Optimizer, with Applications to Suitably-Orthogonal Attention, and Optimal Transport' (ICL…☆20Jan 1, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Tensorflow Implementation of Paper "Improved Training of Wasserstein GANs"☆24Apr 20, 2018Updated 8 years ago
- ☆20Sep 28, 2024Updated last year
- ☆19Mar 13, 2023Updated 3 years ago
- Verilog implementation of different concepts in Digital Logic Design such as OTHFSM, AFG and Accelerators☆11Dec 26, 2023Updated 2 years ago
- Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.☆13Jun 5, 2024Updated 2 years ago
- ☆17May 18, 2023Updated 3 years ago
- Torch MinGRU implementation based on "Were RNNs All We Needed?"☆23Dec 5, 2024Updated last year
- PyTorch implementation of EEGDfus☆29Oct 9, 2025Updated 8 months ago
- Implementation of Sparse Regression Codes (SPARCs)/Sparse Superposition Codes for communications over the AWGN channel.☆14Nov 23, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Codes for Spiking Neural Networks with Improved Inherent Recurrence Dynamics for Sequential Learning☆11May 5, 2022Updated 4 years ago
- The code for the Subformer, from the EMNLP 2021 Findings paper: "Subformer: Exploring Weight Sharing for Parameter Efficiency in Generati…☆16Sep 1, 2021Updated 4 years ago
- ☆16Oct 5, 2022Updated 3 years ago
- pytorch fixed point training tool/framework☆34Oct 14, 2020Updated 5 years ago
- ☆31Jun 8, 2022Updated 4 years ago
- Pytorch Implementation of Spiking Neural Networks Calibration, ICML 2021☆92May 9, 2024Updated 2 years ago
- Collection of PyTorch implementations of Spoken Keyword Spotting presented in research papers.☆40Apr 5, 2024Updated 2 years ago
- A C++ implementation of a 3 layer Gated Recurrent Unit (GRU) using no libraries other than Eigen for Matrices.☆23Jan 28, 2020Updated 6 years ago
- MATLAB/Octave generator of Hamming ECC coding. Output format is Verilog HDL.☆12Dec 27, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This project is the code of BF-GCN. The paper has been accepted by IEEE Transactions on Neural Networks and Learning Systems.☆24Jul 2, 2024Updated 2 years ago
- ☆34Oct 4, 2024Updated last year
- Official implementation for the IJCAI'24 paper: SDformer☆34Mar 6, 2025Updated last year
- Modelling the insect navigation toolkit☆12Mar 15, 2024Updated 2 years ago
- FactorizePhys: Matrix Factorization for Multidimensional Attention in Remote Physiological Sensing [NeurIPS 2024]☆32Aug 12, 2025Updated 10 months ago
- CasHMC: A Cycle-accurate Simulator for Hybrid Memory Cube☆24Aug 10, 2018Updated 7 years ago
- Neural Network Acceleration such as ASIC, FPGA, GPU, and PIM☆54Apr 13, 2020Updated 6 years ago