Artifact repository for paper Automatic Generation of High-Performance Quantized Machine Learning Kernels
☆17Oct 13, 2020Updated 5 years ago
Alternatives and similar repositories for cgo-artifact-2020
Users that are interested in cgo-artifact-2020 are comparing it to the libraries listed below
Sorting:
- ☆23Oct 7, 2025Updated 5 months ago
- egraph <-> json☆16Dec 29, 2025Updated 2 months ago
- Floating point modules for CHISEL☆32Nov 2, 2014Updated 11 years ago
- MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)☆56May 29, 2024Updated last year
- firrtlator is a FIRRTL C++ library☆23Dec 15, 2016Updated 9 years ago
- GPU Code optimizer for stencil computations. Refer to our IPDPS'19 paper for more details☆25Sep 27, 2019Updated 6 years ago
- PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models. ICML 2021☆56Jul 21, 2021Updated 4 years ago
- ☆36Mar 29, 2023Updated 2 years ago
- Using e-graphs for logic synthesis (ICCAD'25)☆33Mar 12, 2026Updated last week
- The Next-gen Language & Compiler Powering Efficient Hardware Design☆36Jan 16, 2025Updated last year
- GPU Performance Advisor☆66Jul 25, 2022Updated 3 years ago
- A repository containing homework labs for CSE548☆42Jun 8, 2017Updated 8 years ago
- Horizontal Fusion☆24Jan 7, 2022Updated 4 years ago
- The code for Joint Neural Architecture Search and Quantization☆14Apr 10, 2019Updated 6 years ago
- A tool for model sparse based on torch.fx☆13Jun 3, 2024Updated last year
- Reticle evaluation (PLDI 2021)☆12Apr 12, 2021Updated 4 years ago
- A High-performance Timing Analysis Tool for VLSI Systems☆10Feb 11, 2021Updated 5 years ago
- ☆12Jan 6, 2023Updated 3 years ago
- RedEye is a vision sensor designed to execute early stages of a deep convolutional neural network (ConvNet) in the analog domain. This re…☆14Dec 16, 2016Updated 9 years ago
- Efficient GPU kernels for mixed-precision Vision Transformers in Triton☆18Sep 18, 2025Updated 6 months ago
- Create cross repository milestones in Github☆10Nov 10, 2025Updated 4 months ago
- ☆10Dec 18, 2023Updated 2 years ago
- RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads☆47Apr 7, 2021Updated 4 years ago
- Your AI-Powered Debugging Companion 🤖☆11Dec 16, 2023Updated 2 years ago
- SQL Optimizations using MLIR☆12Apr 5, 2020Updated 5 years ago
- PYNQ with Chisel and Rust☆26Jan 2, 2018Updated 8 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆137Feb 21, 2022Updated 4 years ago
- Implementation of Neural Arithmetic Logic Units (https://arxiv.org/pdf/1808.00508.pdf)☆31Oct 24, 2018Updated 7 years ago
- Formal semantics of BSV (Bluespec SystemVerilog), given as a Haskell Program and accompanying document☆18Jul 17, 2016Updated 9 years ago
- [ECCV 2024] CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs☆18Jul 2, 2024Updated last year
- Convert ANY IR to ONNX format☆26Feb 12, 2026Updated last month
- Extract your SlidesLive presentation.☆15Apr 19, 2024Updated last year
- Net prototxt generator for Caffe☆13Aug 25, 2016Updated 9 years ago
- PyTorch Quantization Framework For OCP MX Datatypes.☆16May 30, 2025Updated 9 months ago
- ☆11Jan 21, 2019Updated 7 years ago
- Simple intermediate representation language for learning and research.☆20Mar 27, 2020Updated 5 years ago
- Using e-graphs to synthesize netlists from boolean logic.☆14Jul 26, 2023Updated 2 years ago
- Perceptron-based branch predictor written in C++☆13Dec 14, 2016Updated 9 years ago
- Keras implementation of YOLOv2 refer to Andrew Ng☆11Feb 14, 2018Updated 8 years ago