thu-ml / 2by4-pretrain-acc-examplesView external linksLinks
Code for "Accelerating Transformer Pre-training with 2:4 Sparsity"
☆27Dec 8, 2024Updated last year
Alternatives and similar repositories for 2by4-pretrain-acc-examples
Users that are interested in 2by4-pretrain-acc-examples are comparing it to the libraries listed below
Sorting:
- Official implementation for "Pruning Large Language Models with Semi-Structural Adaptive Sparse Training" (AAAI 2025)☆18Jul 1, 2025Updated 7 months ago
- ☆11Dec 26, 2025Updated last month
- ☆21Nov 12, 2025Updated 3 months ago
- ☆244Nov 9, 2022Updated 3 years ago
- 简易 OI 交题服务器☆11Dec 12, 2025Updated 2 months ago
- Validation of sycnmers compared to minimizers☆11May 10, 2025Updated 9 months ago
- ☆12Mar 1, 2025Updated 11 months ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated 11 months ago
- [ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading☆13Jun 28, 2025Updated 7 months ago
- An MLIR-based compiler from C/C++ to AMD-Xilinx Versal AIE☆18Aug 5, 2022Updated 3 years ago
- The implementation for maximum clique enumeration algorithm☆11Apr 14, 2016Updated 9 years ago
- RabbitKSSD: accelerating genome distance estimation on modern multi-core architectures☆12Jun 5, 2024Updated last year
- 华为集合通信性能测试☆15May 27, 2024Updated last year
- FPGA 2025 SAT Accel: A modern SAT Solver on FPGA Repository☆14Mar 13, 2025Updated 11 months ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Updated this week
- Open source Photonics PDK for VTT's 3 um SOI platform.☆14May 26, 2025Updated 8 months ago
- Synthetic aperture focusing technique for optoacoustic mesoscopy and scanning acoustic microscopy.☆13Jul 24, 2024Updated last year
- An Fast variant calling tool to detection germline and somatic variants☆11Jan 31, 2026Updated 2 weeks ago
- Benchmark and resources for single super-resolution algorithms☆10Apr 14, 2017Updated 8 years ago
- Implementations of different neural network pruning techniques☆14Aug 10, 2023Updated 2 years ago
- Securing Deep Spiking Neural Networks against Adversarial Attacks through Inherent Structural Parameters☆13Aug 15, 2022Updated 3 years ago
- Repository for AI model benchmarking on TT-Buda☆15Updated this week
- 这里收录比较实用的计算机相关技术书籍,可以在短期之内入门的简单实用教程、一些技术网站以及一些写的比较好的博文,欢迎Fork,你也可以通过Pull Request参与编辑。☆10Jul 21, 2016Updated 9 years ago
- Unsupervised anomaly detection in the latent space of high energy physics events with quantum machine learning.☆20Oct 29, 2024Updated last year
- DartMinHash: Fast Sketching for Weighted Sets☆12Dec 8, 2025Updated 2 months ago
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆12Nov 8, 2024Updated last year
- Google DeepMind: Mixture of Depths Unofficial Implementation.☆12May 29, 2024Updated last year
- ☆12Nov 24, 2023Updated 2 years ago
- Batch Multi-Fidelity Bayesian Optimization with Deep Auto-Regressive Networks☆12Nov 3, 2021Updated 4 years ago
- ☆16Updated this week
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆12Apr 17, 2023Updated 2 years ago
- CAM: Asynchronous GPU-Initiated, CPU-Managed SSD Management for Batching Storage Access [ICDE'25]☆18Mar 3, 2025Updated 11 months ago
- [ICLR25] STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs☆18Jun 3, 2025Updated 8 months ago
- ☆13Aug 1, 2024Updated last year
- ☆15Dec 8, 2022Updated 3 years ago
- A PyTorch Implementation of YOLOv3☆13Apr 16, 2019Updated 6 years ago
- Machine Learning-Enabled Compact Photonic Tensor Core based on Programmable Multi-Operand Multimode Interference☆13Sep 23, 2024Updated last year
- MEEP FPGA Shell project, currently supporting Alveos u280 and u55c☆14Mar 14, 2024Updated last year
- ☆13May 27, 2020Updated 5 years ago