yc2367/BBS-MICRO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yc2367/BBS-MICRO)

yc2367 / BBS-MICRO

☆19

Alternatives and similar repositories for BBS-MICRO

Users that are interested in BBS-MICRO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CLab-HKUST-GZ / micro58-axcore
View on GitHub
☆41Oct 21, 2025Updated 9 months ago
dubcyfor3 / Focus
View on GitHub
[HPCA 2026 Best Paper Candidate] Official implementation of "Focus: A Streaming Concentration Architecture for Efficient Vision-Language …
☆59Feb 8, 2026Updated 5 months ago
abdelfattah-lab / BitMoD-HPCA-25
View on GitHub
☆157Jul 19, 2025Updated last year
ZongwuWang / MILLION
View on GitHub
This repository presents the source code for the paper "MILLION: Mastering Long-Context LLM Inference Via Outlier-Immunized KV Product Qu…
☆25Apr 2, 2025Updated last year
abhibambhaniya / progressive_gradient_flow_nm_sparsity
View on GitHub
Implementation of NM sparsity recipe presented in the paper "Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers".
☆11Feb 5, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Zhu-ZiXuan / Bitlet-PE
View on GitHub
A bit-level sparsity-awared multiply-accumulate process element.
☆19Jul 9, 2024Updated 2 years ago
kelvin0207 / SparSynergy
View on GitHub
Open source RTL implementation of Tensor Core, Sparse Tensor Core, BitWave and SparSynergy in the article: "SparSynergy: Unlocking Flexib…
☆26Mar 29, 2025Updated last year
ZhanqiuHu / flash-dlm-experimental
View on GitHub
Implementation of Flash-DLM (paper: FlashDLM: Accelerating Diffusion Language Models via Efficient KV Caching and Guided Diffusion). Prov…
☆24Nov 25, 2025Updated 8 months ago
abdelfattah-lab / NVFP4-RaZeR
View on GitHub
☆34Jun 17, 2026Updated last month
tsinghua-ideal / spada-sim
View on GitHub
The simulator for SPADA, an SpGEMM accelerator with adaptive dataflow
☆47Jan 26, 2023Updated 3 years ago
snu-comparch / Tender
View on GitHub
Tender: Accelerating Large Language Models via Tensor Decompostion and Runtime Requantization (ISCA'24)
☆34Jul 4, 2024Updated 2 years ago
CASR-HKU / DPACS
View on GitHub
☆19Mar 21, 2023Updated 3 years ago
SJTU-ReArch-Group / M2XFP_ASPLOS26
View on GitHub
[ASPLOS 2026] M2XFP: A Metadata-Augmented Microscaling Data Format for Efficient Low-bit Quantization.
☆15Jan 29, 2026Updated 5 months ago
georgia-tech-synergy-lab / MicroScopiQ-LLM-Quantization
View on GitHub
[ISCA 2025] Official Implementation of "MicroScopiQ: Accelerating Foundational Models through Outlier-Aware Microscaling Quantization"
☆24Oct 30, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Equationliu / Kangaroo
View on GitHub
[NeurIPS 2024] The official implementation of "Kangaroo: Lossless Self-Speculative Decoding for Accelerating LLMs via Double Early Exitin…
☆72Jun 26, 2024Updated 2 years ago
XBQ314 / A-Number-Theoretic-Transform-Accelerator-with-Two-Parallel-Simplified-Butterfly-Units
View on GitHub
Implementation of Number-theoretic transform(NTT) algorithm on FPGA; 快速数论变换(NTT)的FPGA实现，基为2，有两个并行的蝶形单元
☆19Sep 16, 2022Updated 3 years ago
clevercool / ANT-Quantization
View on GitHub
☆123Nov 17, 2023Updated 2 years ago
parsa-epfl / quantization-sparsity-interplay
View on GitHub
This repo contains the code for studying the interplay between quantization and sparsity methods
☆26Feb 26, 2025Updated last year
jha-lab / acceltran
View on GitHub
[TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers
☆62Nov 22, 2023Updated 2 years ago
CASR-HKU / MSD-FCCM23
View on GitHub
Open-source of MSD framework
☆16Sep 12, 2023Updated 2 years ago
IPADS-SAI / WaferAI-SIM
View on GitHub
The wafer-native AI accelerator simulation platform and inference engine.
☆57Jan 1, 2026Updated 6 months ago
SET-Scheduling-Project / GEMINI-HPCA2024
View on GitHub
Open-source Framework for HPCA2024 paper: Gemini: Mapping and Architecture Co-exploration for Large-scale DNN Chiplet Accelerators
☆116Apr 28, 2025Updated last year
imagination-research / EEP
View on GitHub
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs
☆25Nov 11, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Fibertree-Project / fibertree
View on GitHub
Fibertree emulator
☆17Nov 4, 2024Updated last year
tsinghua-fib-lab / HENCE
View on GitHub
The official implementation of AAAI 2024 paper: Estimating On-road Transportation Carbon Emissions from Open Data of Road Network and Ori…
☆12Feb 24, 2024Updated 2 years ago
upenn-acg / RPG2-public
View on GitHub
RPG^2 is a pure-software system that operates on running C/C++ programs, profiling them, injecting prefetch instructions, and then tuning…
☆14May 15, 2024Updated 2 years ago
horizon-research / Efficient-Deep-Learning-for-Point-Clouds
View on GitHub
☆49Apr 22, 2021Updated 5 years ago
AICrossSim / PLENA_Simulator
View on GitHub
☆22Updated this week
CASR-HKU / AGNA-FCCM2023
View on GitHub
☆12Nov 24, 2023Updated 2 years ago
georgia-tech-synergy-lab / SIGMA
View on GitHub
RTL implementation of Flex-DPE.
☆117Feb 22, 2020Updated 6 years ago
CAS-CLab / BlockConv
View on GitHub
[TCAD 2021] Block Convolution: Towards Memory-Efficient Inference of Large-Scale CNNs on FPGA
☆17Jul 7, 2022Updated 4 years ago
Michaela1224 / SDA_code
View on GitHub
SDA: Low-Bit Stable Diffusion Acceleration on Edge FPGAs
☆19May 23, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ChenMnZ / INT_vs_FP
View on GitHub
[ICML 2026]A framework to compare low-bit integer and float-point formats
☆81May 6, 2026Updated 2 months ago
pku-lemonade / TokenSim
View on GitHub
TokenSim is a tool for simulating the behavior of large language models (LLMs) in a distributed environment.
☆27Jun 26, 2026Updated last month
maeri-project / FEATHER
View on GitHub
A Reconfigurable Accelerator with Data Reordering Support for Low-Cost On-Chip Dataflow Switching
☆91Apr 26, 2026Updated 3 months ago
SamsungLabs / Butterfly_Acc
View on GitHub
☆23Jun 25, 2025Updated last year
SFU-HiAccel / HiSpMV
View on GitHub
[TRETS 2025][FPGA 2024] FPGA Accelerator for Imbalanced SpMV using HLS
☆23Aug 24, 2025Updated 11 months ago
harvard-acc / EdgeBERT
View on GitHub
HW/SW co-design of sentence-level energy optimizations for latency-aware multi-task NLP inference
☆54Mar 24, 2024Updated 2 years ago
KULeuven-MICAS / zigzag-llm
View on GitHub
Model LLM inference on single-core dataflow accelerators
☆19Dec 16, 2025Updated 7 months ago