Unofficial implementation of https://arxiv.org/pdf/2407.14679
☆53Sep 7, 2024Updated last year
Alternatives and similar repositories for Compact-Language-Models-via-Pruning-and-Knowledge-Distillation
Users that are interested in Compact-Language-Models-via-Pruning-and-Knowledge-Distillation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated 2 months ago
- https://feedback-agent.onrender.com/☆20Mar 1, 2025Updated last year
- A tiny easily hackable implementation of a feature dashboard.☆16Oct 21, 2025Updated 7 months ago
- A family of compressed models obtained via pruning and knowledge distillation☆380Nov 6, 2025Updated 7 months ago
- D^2-MoE: Delta Decompression for MoE-based LLMs Compression☆82Mar 25, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"☆12Oct 14, 2025Updated 8 months ago
- NLP @ TU Wien☆18Dec 11, 2024Updated last year
- KDSS is the framework for knowledge distillation from LLMs☆12Nov 5, 2025Updated 7 months ago
- [Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…☆41Sep 9, 2025Updated 9 months ago
- [ICCV W] Contextual Convolutional Neural Networks (https://arxiv.org/pdf/2108.07387.pdf)☆14Aug 18, 2021Updated 4 years ago
- Benchmarks for Macro Neural Architecture Search; used and described in the paper "Local Search is a Remarkably Strong Baseline for Neural…☆13Jul 25, 2024Updated last year
- [NeurIPS 2024] Search for Efficient LLMs☆16Jan 16, 2025Updated last year
- ☆20Aug 16, 2021Updated 4 years ago
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆21Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- This repo contains the source code for: Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs☆43Aug 14, 2024Updated last year
- lime-ner: extending LIME for Named Entity Recognition☆10Aug 15, 2018Updated 7 years ago
- Lazy one's Flask application☆11Aug 13, 2016Updated 9 years ago
- Code for the arXiv preprint "Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions"☆15Aug 2, 2025Updated 10 months ago
- Elevating Chess Strategy with Fine-Tuned Large Language Model☆18Dec 8, 2023Updated 2 years ago
- ViT architecture with Mamba instead of transformer backbone☆17Dec 8, 2023Updated 2 years ago
- Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)☆18Jun 18, 2024Updated last year
- ☆41Nov 22, 2025Updated 6 months ago
- Handwritten number recognition with STMF429 TFlite micro mnist☆31Feb 12, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity (ACL 2025, oral)☆34Jun 14, 2025Updated last year
- NDL古典籍OCR学習用データセット(みんなで翻刻加工データ)☆20Mar 13, 2026Updated 3 months ago
- A vanilla implementation of ReAct: Synergizing Reasoning and Acting in Language Models☆17Mar 26, 2025Updated last year
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- 音声を文字起こししてChatGPTと会話したい☆22Mar 8, 2023Updated 3 years ago
- Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)☆69Mar 27, 2025Updated last year
- The OpenAI Whisper speech-to-text model as a simple HTTP server☆14Oct 26, 2023Updated 2 years ago
- Code for "Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes"☆32Mar 28, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- AI powered Virtual Desktop☆16Jun 7, 2026Updated last week
- ☆16Apr 2, 2025Updated last year
- official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"☆25Oct 11, 2025Updated 8 months ago
- Implementation of PGONAS for CVPR22W and RD-NAS for ICASSP23☆23Apr 25, 2023Updated 3 years ago
- It checks how secure the program you made is and shows how vulnerable your program is.☆20Apr 20, 2017Updated 9 years ago
- ☆18Apr 30, 2025Updated last year
- ☆14Dec 12, 2024Updated last year