Block Sparse movement pruning
☆83Nov 26, 2020Updated 5 years ago
Alternatives and similar repositories for block_movement_pruning
Users that are interested in block_movement_pruning are comparing it to the libraries listed below
Sorting:
- Prune a model while finetuning or training.☆406Jun 21, 2022Updated 3 years ago
- ☆17May 14, 2020Updated 5 years ago
- [KDD'22] Learned Token Pruning for Transformers☆101Feb 27, 2023Updated 3 years ago
- Streamlit, but better.☆16Feb 5, 2024Updated 2 years ago
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Jun 9, 2021Updated 4 years ago
- Data for the ACL SRW 2020 paper "Understanding Points of Correspondence between Sentences for Abstractive Summarization"☆20Nov 2, 2022Updated 3 years ago
- [NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers☆192Feb 28, 2023Updated 3 years ago
- benchmarks for evaluating MT models☆11Jun 26, 2024Updated last year
- Tensorflow implementation of "Meta Dropout: Learning to Perturb Latent Features for Generalization" (ICLR 2020)☆27Apr 27, 2020Updated 5 years ago
- [ACL'22] Training-free Neural Architecture Search for RNNs and Transformers☆14May 26, 2024Updated last year
- Python library for backtranslation (with Google Translate)☆12Jan 11, 2020Updated 6 years ago
- ☆13Aug 28, 2018Updated 7 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- Official implementation of Neurips 2020 "Sparse Weight Activation Training" paper.☆29Jul 23, 2021Updated 4 years ago
- ☆13Nov 4, 2025Updated 3 months ago
- 基于自由度(熵)、凝固度 新词发现算法实现☆12Oct 7, 2018Updated 7 years ago
- Pytorch implementation of our paper (TNNLS) -- Pruning Networks with Cross-Layer Ranking & k-Reciprocal Nearest Filters☆12Feb 24, 2022Updated 4 years ago
- ☆12May 22, 2022Updated 3 years ago
- Companion code to https://arxiv.org/abs/2409.03797v2☆19Sep 18, 2025Updated 5 months ago
- [ACL 2021] Learning to Perturb Word Embeddings for Out-of-distribution QA☆16May 11, 2022Updated 3 years ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- Official code for the paper "HEXA-MoE: Efficient and Heterogeneous-Aware MoE Acceleration with Zero Computation Redundancy"☆15Mar 6, 2025Updated 11 months ago
- Code for the paper "Are Sixteen Heads Really Better than One?"☆175Apr 1, 2020Updated 5 years ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆19Jun 13, 2025Updated 8 months ago
- [ECCV 2024] Code for the paper "Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network"☆17Jul 27, 2024Updated last year
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`☆20Jan 8, 2026Updated last month
- Structured Pruning Adapters in PyTorch☆19Aug 30, 2023Updated 2 years ago
- Code for Episodic Memory Reader (EMR) https://arxiv.org/abs/1903.06164☆15Nov 16, 2022Updated 3 years ago
- Test code of Inverse cloze task for information retrieval☆33Jan 10, 2021Updated 5 years ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆42Jan 15, 2024Updated 2 years ago
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Apr 21, 2025Updated 10 months ago
- 国内外数据竞赛资讯整理☆18Nov 6, 2021Updated 4 years ago
- Hugging Face and Pyserini interoperability☆19May 18, 2023Updated 2 years ago
- GAU-alpha-pytorch☆20May 11, 2022Updated 3 years ago
- Python Template Repository☆19Updated this week
- [ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?☆16Aug 8, 2023Updated 2 years ago
- Official code of our work, PolicyQA: A Reading Comprehension Dataset for Privacy Policies [Findings of EMNLP 2020].☆16Nov 8, 2021Updated 4 years ago
- ☆18Nov 6, 2019Updated 6 years ago
- Fast Block Sparse Matrices for Pytorch☆549Jan 21, 2021Updated 5 years ago