[ICLR25] STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs
☆20Jun 3, 2025Updated 10 months ago
Alternatives and similar repositories for STBLLM
Users that are interested in STBLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Feb 5, 2024Updated 2 years ago
- ☆33Dec 10, 2025Updated 4 months ago
- The official implementation of the ICML 2023 paper OFQ-ViT☆39Oct 3, 2023Updated 2 years ago
- General Image Classification Code base☆22Jul 12, 2021Updated 4 years ago
- [ICCAD 2025] Squant☆15Jul 3, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- D^2-MoE: Delta Decompression for MoE-based LLMs Compression☆82Mar 25, 2025Updated last year
- Artifact evaluation for HPCA'24 paper Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accele…☆11Mar 3, 2024Updated 2 years ago
- Imitation Learning; Robotics; Policy; VLA;☆35Apr 24, 2026Updated last week
- ☆11Dec 19, 2023Updated 2 years ago
- Work in progress.☆80Nov 25, 2025Updated 5 months ago
- ☆36Apr 22, 2026Updated last week
- [ACL 2025] RealHiTBench: A Comprehensive Realistic Hierarchical Table Benchmark for Evaluating LLM-Based Table Analysis☆25Aug 8, 2025Updated 8 months ago
- ☆20Apr 19, 2021Updated 5 years ago
- [ICLR2025]: OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitt…☆92Apr 8, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆16Nov 22, 2022Updated 3 years ago
- This is the official repo for the ICML 2025 paper "Tuning-Free Alignment of Diffusion Models with Direct Noise Optimization" Tang et al☆20Jun 8, 2025Updated 10 months ago
- [ICLR'25] ARB-LLM: Alternating Refined Binarizations for Large Language Models☆29Aug 5, 2025Updated 8 months ago
- TAGMol: Target-Aware Gradient-guided Molecule Generation (ICML'24 ML4LMS Workshop)☆14Aug 29, 2024Updated last year
- [ICCV 2021] Code release for "Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks"☆33Jul 24, 2022Updated 3 years ago
- [TECS'23] A project on the co-design of Accelerators and CNNs.☆21Dec 10, 2022Updated 3 years ago
- PyTorch implements `MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications` paper.☆16May 25, 2023Updated 2 years ago
- ☆17Nov 27, 2023Updated 2 years ago
- The official implementation of the paper "Data Contamination Calibration for Black-box LLMs" (ACL 2024)☆16May 21, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- DCPO: Dynamic Adaptive Clipping for RL☆48Apr 1, 2026Updated last month
- Tensorflow implementation of the TGRS paper entitled Oil Spill Segmentation via Adversarial f-Divergence Learning.☆12Mar 9, 2019Updated 7 years ago
- Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning☆19Dec 26, 2025Updated 4 months ago
- Open source RTL implementation of Tensor Core, Sparse Tensor Core, BitWave and SparSynergy in the article: "SparSynergy: Unlocking Flexib…☆23Mar 29, 2025Updated last year
- ☆21Apr 2, 2024Updated 2 years ago
- On Uncertainty, Tempering, and Data Augmentation in Bayesian Classification☆21Apr 1, 2022Updated 4 years ago
- Experimental deep learning framework written in Rust☆15Nov 2, 2022Updated 3 years ago
- This is an implementation of YOLO using LSQ network quantization method.☆22Apr 13, 2022Updated 4 years ago
- Code for "Accelerating Transformer Pre-training with 2:4 Sparsity"☆27Dec 8, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Quality Diversity through Human Feedback: Towards Open-Ended Diversity-Driven Optimization (ICML 2024)☆19Apr 6, 2025Updated last year
- [ACM MM 2023] Official code for "TIRDet: Mono-Modality Thermal InfraRed Object Detection Based on Prior Thermal-To-Visible Translation"☆23Dec 3, 2025Updated 4 months ago
- ☆26Mar 1, 2024Updated 2 years ago
- Distilling BERT using natural language generation.☆39Aug 13, 2023Updated 2 years ago
- List of papers related to neural network quantization in recent AI conferences and journals.☆820Mar 27, 2025Updated last year
- Implementation of NeurIPS 2019 paper "Normalization Helps Training of Quantized LSTM"☆31Jul 25, 2024Updated last year
- ☆17Aug 6, 2023Updated 2 years ago