Aaquib111 / Sparse-GPT-FinetuningView external linksLinks
Code for my ICLR 2024 TinyPapers paper "Prune and Tune: Improving Efficient Pruning Techniques for Massive Language Models"
☆16May 26, 2023Updated 2 years ago
Alternatives and similar repositories for Sparse-GPT-Finetuning
Users that are interested in Sparse-GPT-Finetuning are comparing it to the libraries listed below
Sorting:
- Mamba support for transformer lens☆19Sep 17, 2024Updated last year
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆22Nov 18, 2024Updated last year
- Pytorch code for paper QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models☆25Sep 27, 2023Updated 2 years ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆33Aug 14, 2024Updated last year
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆35Mar 7, 2025Updated 11 months ago
- Simplification of pruned models for accelerated inference | SoftwareX https://doi.org/10.1016/j.softx.2021.100907☆36Feb 25, 2025Updated 11 months ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆39Mar 11, 2024Updated last year
- Code for my NeurIPS 2024 ATTRIB paper titled "Attribution Patching Outperforms Automated Circuit Discovery"☆47May 31, 2024Updated last year
- ☆53May 19, 2025Updated 8 months ago
- The example of Ionic 3 Angular 5 search and sort list of data☆10Dec 18, 2017Updated 8 years ago
- MCP server for GNU Radio☆30Jan 5, 2026Updated last month
- ☆11Aug 20, 2025Updated 5 months ago
- ☆66Jul 8, 2025Updated 7 months ago
- OpenCV Text Detection (EAST text detector)☆12Jul 15, 2019Updated 6 years ago
- Effective Attention Sheds Light On Interpretability - Findings of ACL2021☆11May 16, 2021Updated 4 years ago
- A simple script to add pdf-files to Zotero via CLI☆12May 17, 2020Updated 5 years ago
- Everything you need to reproduce "Better plain ViT baselines for ImageNet-1k" in PyTorch, and more☆12Feb 3, 2026Updated last week
- ☆11Nov 28, 2025Updated 2 months ago
- A very hacky set of functions for getting plotly to do what I want when doing mech interp research, designed to be compatible with PyTorc…☆12Jun 16, 2023Updated 2 years ago
- Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…☆11Feb 6, 2023Updated 3 years ago
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Mar 20, 2025Updated 10 months ago
- An app to view 360 degree videos☆10Jun 26, 2017Updated 8 years ago
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"☆10Dec 9, 2023Updated 2 years ago
- BH hackathon☆14Apr 4, 2024Updated last year
- Python Data Controller for Neural EEG headsets. (Windows + Linux)☆12Dec 27, 2017Updated 8 years ago
- ☆13Apr 10, 2025Updated 10 months ago
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆13Jan 30, 2026Updated 2 weeks ago
- ☆11Dec 5, 2025Updated 2 months ago
- Install Wireguard systemlessly☆11Dec 27, 2017Updated 8 years ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- ☆14Jan 24, 2025Updated last year
- UnitEval is a benchmarking and evaluation tools for AutoDev Coder.☆13Jan 2, 2024Updated 2 years ago
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆14Aug 8, 2025Updated 6 months ago
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 6 months ago
- Improving transparency of large language models' reasoning☆14Nov 25, 2025Updated 2 months ago
- An implementation of Google places autocomplete in Angular 6☆14Feb 21, 2019Updated 6 years ago
- "Causality: Models, Reasoning, and Inference-Judea Pearl(2009)"中文翻译及学习笔记☆15Feb 18, 2022Updated 3 years ago
- MCP server for controlling ThreeJs source code, only basic function☆21Mar 23, 2025Updated 10 months ago
- ☆12Oct 9, 2023Updated 2 years ago