A collection of various llm pruning implementations, training code for GPUs & TPUs, and evaluation script.
β63Feb 18, 2026Updated last month
Alternatives and similar repositories for llm-pruning-collection
Users that are interested in llm-pruning-collection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NAACL'25 π SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expertβ¦β16Feb 4, 2025Updated last year
- β16Updated this week
- Code for "What really matters in matrix-whitening optimizers?"β23Oct 31, 2025Updated 4 months ago
- ThinkGen: Generalized Thinking for Visual Generationβ52Dec 30, 2025Updated 3 months ago
- β62Updated this week
- NordVPN Special Discount Offer β’ AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Modelsβ26Mar 26, 2025Updated last year
- β13Aug 17, 2020Updated 5 years ago
- World-Gymnast: Training Robots with Reinforcement Learning in a World Modelβ30Feb 11, 2026Updated last month
- Official implementation of Categorical Flow Maps on text.β49Feb 16, 2026Updated last month
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewardsβ37Oct 3, 2025Updated 5 months ago
- β21Dec 3, 2025Updated 3 months ago
- PeRL: Parameter-Efficient Reinforcement Learningβ73Mar 10, 2026Updated 2 weeks ago
- β111Feb 19, 2026Updated last month
- β32Mar 13, 2026Updated 2 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Inverse Scaling in Test-Time Computeβ25Dec 3, 2025Updated 3 months ago
- β11Dec 13, 2023Updated 2 years ago
- Calculate mean of pairwise weighted distances between points using great circle metric.β11Jul 6, 2023Updated 2 years ago
- RePo: Language Models with Context Re-Positioningβ74Dec 24, 2025Updated 3 months ago
- A Portfolio Theme for Jekyllβ12Nov 12, 2021Updated 4 years ago
- [CVPR 2026 (Findings) π₯π₯] Self Evolving Large Multimodal Models with Continuous Rewardsβ21Mar 5, 2026Updated 3 weeks ago
- A collection of statistical functions writtten in pythonβ13Oct 29, 2021Updated 4 years ago
- ROSA+: RWKV's ROSA implementation with fallback statistical predictorβ35Oct 13, 2025Updated 5 months ago
- Materials for the 2017 QMSS Python Workshopβ12Jun 22, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Scripts and notebooks for models that enable better causal inferences in psychological sciencesβ11Dec 24, 2024Updated last year
- β15Mar 13, 2026Updated 2 weeks ago
- Contains the matrix generation software and normed matrices described in "Recreating Raven's: Software for systematically generating largβ¦β15Dec 4, 2023Updated 2 years ago
- Tools for computational psychiatry research.β11Dec 8, 2024Updated last year
- Practical Python exercises on classical computer vision and clean engineering practicesβ26Apr 30, 2025Updated 11 months ago
- Code related to data for the Item Response Warehouseβ11Mar 22, 2026Updated last week
- Quartet II Official Codeβ63Mar 23, 2026Updated last week
- Source code and data of our paper "Missing Counter-Evidence Renders NLP Fact-Checking Unrealistic for Misinformation" (https://arxiv.org/β¦β10Jun 21, 2023Updated 2 years ago
- The official repo of VideoAgentTrekβ47Oct 24, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- BFAST3D: Bayesian Fast Accurate Spatial Tricks in 3D. For fMRI analysis.β11Sep 30, 2020Updated 5 years ago
- Training tiny models to prove hard theoremsβ64Mar 5, 2026Updated 3 weeks ago
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".β53Dec 28, 2025Updated 3 months ago
- Memory efficient one-hot array encodingsβ20May 29, 2025Updated 10 months ago
- Evaluation kit for testing stateful agentsβ61Mar 19, 2026Updated last week
- [KernelGYM & Dr. Kernel] A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generationsβ152Updated this week
- β37Dec 16, 2025Updated 3 months ago