A selective knowledge distillation algorithm for efficient speculative decoders
☆40Nov 27, 2025Updated 6 months ago
Alternatives and similar repositories for adaspec
Users that are interested in adaspec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of DART (DART: Diffusion-Inspired Speculative Decoding for Fast LLM Inference).☆60Feb 8, 2026Updated 4 months ago
- An AlphaZero engine for Saiblo Connect4, featuring a pure Python implementation of key KataGo techniques.☆18Apr 21, 2026Updated last month
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆57Mar 12, 2026Updated 3 months ago
- A WebUI for Side-by-Side Comparison of Media (Images/Videos) Across Multiple Folders☆27Feb 21, 2025Updated last year
- ☆11Dec 26, 2025Updated 5 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official implementation for "Pruning Large Language Models with Semi-Structural Adaptive Sparse Training" (AAAI 2025)☆19Jul 1, 2025Updated 11 months ago
- ☆20Dec 24, 2024Updated last year
- ☆12May 19, 2022Updated 4 years ago
- Code for "Accelerating Transformer Pre-training with 2:4 Sparsity"☆27Dec 8, 2024Updated last year
- [ICML 2025] Generalization Principles for Inference over Text-Attributed Graphs with Large Language☆22Jul 15, 2025Updated 10 months ago
- [ICML 2025] SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity☆76Mar 10, 2026Updated 3 months ago
- Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training☆40May 4, 2026Updated last month
- ☆13Dec 9, 2024Updated last year
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆22Dec 16, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆25Dec 2, 2025Updated 6 months ago
- An Tensorflow.keras implementation of Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorizatio…☆10Dec 18, 2019Updated 6 years ago
- [ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring☆281Jul 6, 2025Updated 11 months ago
- SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs☆65Mar 25, 2025Updated last year
- An LLM inference engine, written in C++☆20Mar 30, 2026Updated 2 months ago
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…☆29Feb 17, 2025Updated last year
- Code for ICML 2021 submission☆35Mar 24, 2021Updated 5 years ago
- A buyers checklist guide for purchasing your new BYD Car☆10Feb 18, 2024Updated 2 years ago
- Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent☆16Sep 8, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆60Feb 28, 2026Updated 3 months ago
- [NAACL 2025 Main Conference] PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization☆27Mar 29, 2025Updated last year
- Code needed to reproduce results from my ICLR 2019 paper on fixed-point quantization of the backprop algorithm.☆10Jan 24, 2019Updated 7 years ago
- PyTorch code for full quantization of DNN using BCGD☆14Jul 24, 2019Updated 6 years ago
- streaming video and audio using hls and flv.☆11Feb 22, 2022Updated 4 years ago
- 303 份 AI/LLM 中文讲义,支持在线阅读、PDF 下载和 LaTeX 源码查看 | Stanford CS336/CS224R/CS25 | Berkeley LLM Agents | Agent 工程实践☆119May 25, 2026Updated 2 weeks ago
- ☆19Dec 4, 2025Updated 6 months ago
- Adaptation of Superpower in the ML field☆117May 18, 2026Updated 3 weeks ago
- ☆11Dec 8, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- KimiaPath24: Dataset for retrieval and classification in digital pathology☆13Jun 4, 2017Updated 9 years ago
- [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆151Dec 4, 2024Updated last year
- Helper methods for Pandas Series and DataFrames to calculate numerically derivative and integral☆11Jun 7, 2019Updated 7 years ago
- Example code for Cytron Raspberry Pi 10A Motor Driver Duo Hat☆12Feb 22, 2023Updated 3 years ago
- Official implementation of ICML'24 paper "LQER: Low-Rank Quantization Error Reconstruction for LLMs"☆19Jul 11, 2024Updated last year
- ☆137Feb 17, 2026Updated 3 months ago
- Empatica Documentation☆12Feb 18, 2025Updated last year