A block oriented training approach for inference time optimization.
☆34Aug 19, 2024Updated last year
Alternatives and similar repositories for superblock
Users that are interested in superblock are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of "Efficient Latency-Aware CNN Depth Compression via Two-Stage Dynamic Programming" (ICML'23)☆13Jul 11, 2024Updated last year
- FireQ: Fast INT4-FP8 Kernel and RoPE-aware Quantization for LLM Inference Acceleration☆20Jun 27, 2025Updated 8 months ago
- The official implementation of "Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers" (arXiv …☆50Jun 6, 2025Updated 8 months ago
- Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML 2024)☆31Aug 15, 2024Updated last year
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆13Apr 29, 2025Updated 10 months ago
- MobileLLM-R1☆75Sep 30, 2025Updated 5 months ago
- Code for reproducing work of ICML 2019 paper: Memory-Optimal Direct Convolutions for Maximizing Classification Accuracy in Embedded Appli…☆12Jun 8, 2019Updated 6 years ago
- Lustre Repository with MS patches☆13Updated this week
- Python wrappers for the FirecREST API☆12Dec 23, 2025Updated 2 months ago
- MaskedTensors for PyTorch☆38Jul 17, 2022Updated 3 years ago
- Frame-agnostic XAI Library for Computer Vision, for understanding why models behave that way.☆11Feb 19, 2023Updated 3 years ago
- Lustre HSM tools☆10Feb 19, 2024Updated 2 years ago
- extended benchmarking automation tool for HPC applications☆16Feb 23, 2026Updated last week
- ☆12Aug 15, 2023Updated 2 years ago
- Sparsity support for PyTorch☆38Mar 22, 2025Updated 11 months ago
- A tool for generating synthetic data.☆19Dec 19, 2025Updated 2 months ago
- C++ Hough Forests with OpenCV☆11Jul 28, 2016Updated 9 years ago
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆26Jun 16, 2025Updated 8 months ago
- 商品レビュープラグイン☆10Dec 12, 2025Updated 2 months ago
- A webscraper example in golang that scrapes list of projects from your Gitlab account.☆11Aug 22, 2018Updated 7 years ago
- Official repository for "DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Opinion Text Generation"☆10May 20, 2022Updated 3 years ago
- Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"☆12Oct 20, 2024Updated last year
- Cloyster HPC is a turnkey HPC cluster solution with an user-friendly installer☆10Oct 2, 2025Updated 4 months ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- ☆12May 26, 2022Updated 3 years ago
- [NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.☆98Jan 3, 2025Updated last year
- QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning☆168Nov 11, 2025Updated 3 months ago
- Class Project for 18663 - Implementation of FBNet (Hardware-Aware DNAS)☆34Oct 31, 2019Updated 6 years ago
- A simple replacement of MaxTo scripted in AutoHotKey☆10May 17, 2017Updated 8 years ago
- high performance KV database based on bitcask☆11May 12, 2023Updated 2 years ago
- Collection of Singularity build files and scripts to create them for popular Linux Distributions☆10Jun 23, 2022Updated 3 years ago
- GUI for WireSock VPN client on Windows☆14Jul 8, 2024Updated last year
- Source code of our TNNLS paper "Boosting Convolutional Neural Networks with Middle Spectrum Grouped Convolution"☆12Apr 14, 2023Updated 2 years ago
- LUMI software stack: LMOD-based module setup and EasyBuild setup.☆12Updated this week
- Official Implementation of Robustifying and Boosting Training-Free Neural Architecture Search☆10Mar 12, 2024Updated last year
- An unnecessarily tiny and minimal implementation of GPT-2 in NumPy.☆11Feb 12, 2023Updated 3 years ago
- [CVPR 2025] QuartDepth☆17Mar 24, 2025Updated 11 months ago
- A novel Vietnamese dataset for evaluating handwritten text image recognition methods☆15Sep 9, 2023Updated 2 years ago
- Tool to profile usage of HPC resources by regularly probing processes.☆11Updated this week