[ICLR25] STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs
☆20Jun 3, 2025Updated 10 months ago
Alternatives and similar repositories for STBLLM
Users that are interested in STBLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Feb 5, 2024Updated 2 years ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Updated this week
- The official implementation of the ICML 2023 paper OFQ-ViT☆39Oct 3, 2023Updated 2 years ago
- General Image Classification Code base☆22Jul 12, 2021Updated 4 years ago
- 高性能短序列稀疏Mask Attention CUDA算子,针对<1K序列+75%稀疏度优化☆67Mar 18, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- [ICCAD 2025] Squant☆15Jul 3, 2025Updated 9 months ago
- Pytorch implementation of our paper accepted by ICML 2023 -- "Bi-directional Masks for Efficient N:M Sparse Training"☆13Jun 7, 2023Updated 2 years ago
- HFODetector is Python package that that is capable of detecting HFOs with STE / MNI / Hilbert detector. Detection speed is increased by u…☆13Feb 16, 2025Updated last year
- D^2-MoE: Delta Decompression for MoE-based LLMs Compression☆79Mar 25, 2025Updated last year
- Learning-Recurrent-Binary-Ternary-Weights☆13Dec 4, 2018Updated 7 years ago
- Artifact evaluation for HPCA'24 paper Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accele…☆11Mar 3, 2024Updated 2 years ago
- ☆75Mar 26, 2026Updated 2 weeks ago
- Work in progress.☆79Nov 25, 2025Updated 4 months ago
- ☆21Nov 26, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆35Updated this week
- [ACL 2025] RealHiTBench: A Comprehensive Realistic Hierarchical Table Benchmark for Evaluating LLM-Based Table Analysis☆25Aug 8, 2025Updated 8 months ago
- ☆20Apr 19, 2021Updated 4 years ago
- [ICLR2025]: OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitt…☆90Apr 8, 2025Updated last year
- TernGEMM: General Matrix Multiply Library with Ternary Weights for Fast DNN Inference☆14Feb 22, 2022Updated 4 years ago
- ☆16Nov 22, 2022Updated 3 years ago
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆21Jun 16, 2024Updated last year
- ☆19Dec 10, 2021Updated 4 years ago
- [NeurIPS 2023] Token-Scaled Logit Distillation for Ternary Weight Generative Language Models☆18Dec 6, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- [ICLR'25] ARB-LLM: Alternating Refined Binarizations for Large Language Models☆28Aug 5, 2025Updated 8 months ago
- STE/MNI/HIL HFO/Spindle detection, classification and visualization☆17Updated this week
- [ICCV 2021] Code release for "Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks"☆32Jul 24, 2022Updated 3 years ago
- PyTorch implements `MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications` paper.☆17May 25, 2023Updated 2 years ago
- The official implementation of the paper "Data Contamination Calibration for Black-box LLMs" (ACL 2024)☆16May 21, 2024Updated last year
- DCPO: Dynamic Adaptive Clipping for RL☆48Apr 1, 2026Updated last week
- Tensorflow implementation of the TGRS paper entitled Oil Spill Segmentation via Adversarial f-Divergence Learning.☆12Mar 9, 2019Updated 7 years ago
- hardware & software prefetcher☆30Dec 21, 2023Updated 2 years ago
- ☆81Jul 21, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- [ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"☆18Jun 1, 2024Updated last year
- Open source RTL implementation of Tensor Core, Sparse Tensor Core, BitWave and SparSynergy in the article: "SparSynergy: Unlocking Flexib…☆23Mar 29, 2025Updated last year
- PyTorch Implementation of Visual GAIL in Atari Games☆14Dec 7, 2022Updated 3 years ago
- Branch predictor simulation framework for the Last-Level Branch Predictor☆36Dec 3, 2025Updated 4 months ago
- ☆21Apr 2, 2024Updated 2 years ago
- On Uncertainty, Tempering, and Data Augmentation in Bayesian Classification☆21Apr 1, 2022Updated 4 years ago
- This is an implementation of YOLO using LSQ network quantization method.☆22Apr 13, 2022Updated 3 years ago