Block Sparse movement pruning
☆83Nov 26, 2020Updated 5 years ago
Alternatives and similar repositories for block_movement_pruning
Users that are interested in block_movement_pruning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Prune a model while finetuning or training.☆406Jun 21, 2022Updated 3 years ago
- [NeurIPS 2020] "The Lottery Ticket Hypothesis for Pre-trained BERT Networks", Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Ya…☆142Dec 30, 2021Updated 4 years ago
- [KDD'22] Learned Token Pruning for Transformers☆98Feb 27, 2023Updated 3 years ago
- Streamlit, but better.☆16Feb 5, 2024Updated 2 years ago
- Data for the ACL SRW 2020 paper "Understanding Points of Correspondence between Sentences for Abstractive Summarization"☆20Nov 2, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers☆193Feb 28, 2023Updated 3 years ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆23Jun 13, 2025Updated 10 months ago
- Python library for backtranslation (with Google Translate)☆12Jan 11, 2020Updated 6 years ago
- ☆13Aug 28, 2018Updated 7 years ago
- Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"☆25Feb 24, 2023Updated 3 years ago
- Official implementation of Neurips 2020 "Sparse Weight Activation Training" paper.☆29Jul 23, 2021Updated 4 years ago
- Sirius, an efficient correction mechanism, which significantly boosts Contextual Sparsity models on reasoning tasks while maintaining its…☆21Sep 10, 2024Updated last year
- Encode-attend-navigate unofficial Pytorch implementation☆12Oct 1, 2024Updated last year
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆16Feb 4, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Dependency parser on Thai language☆26Jan 25, 2025Updated last year
- PyTorch implementation of OpenAI's REPTILE Algorithm☆26May 8, 2018Updated 7 years ago
- 基于自由度(熵)、凝固度 新词发现算法实现☆12Oct 7, 2018Updated 7 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- [CVPR '23 Highlight] Official repository for the paper "Quantum Multi-Model Fitting".☆11Mar 7, 2025Updated last year
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆43Jan 15, 2024Updated 2 years ago
- Official repo of the paper “AL-GTD: Deep Active Learning for Gaze Target Detection” (ACMMM2024)☆12Nov 29, 2024Updated last year
- [ECCV 2024] Code for the paper "Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network"☆18Jul 27, 2024Updated last year
- Official code of our work, PolicyQA: A Reading Comprehension Dataset for Privacy Policies [Findings of EMNLP 2020].☆16Nov 8, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data☆57Aug 5, 2021Updated 4 years ago
- A generic code base for neural network pruning, especially for pruning at initialization.☆32Sep 3, 2022Updated 3 years ago
- [ACL 2021] Learning to Perturb Word Embeddings for Out-of-distribution QA☆16May 11, 2022Updated 3 years ago
- Code for the paper "Are Sixteen Heads Really Better than One?"☆175Apr 1, 2020Updated 6 years ago
- Collaborative retina modelling across datasets and species.☆19Apr 10, 2026Updated 2 weeks ago
- An NMT framework built on Joint Representation☆12Feb 19, 2020Updated 6 years ago
- Pytorch implementation of our paper (TNNLS) -- Pruning Networks with Cross-Layer Ranking & k-Reciprocal Nearest Filters☆12Feb 24, 2022Updated 4 years ago
- Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization (ACL 2021)☆19Jul 28, 2021Updated 4 years ago
- ☆19Jun 26, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for Episodic Memory Reader (EMR) https://arxiv.org/abs/1903.06164☆15Nov 16, 2022Updated 3 years ago
- Pytorch implementation of the paper "SNIP: Single-shot Network Pruning based on Connection Sensitivity" by Lee et al.☆110Apr 23, 2019Updated 7 years ago
- [ECCV18] Constraint-Aware Deep Neural Network Compression☆12Sep 11, 2018Updated 7 years ago
- [NeurIPS'21] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang…☆88Dec 1, 2023Updated 2 years ago
- ☆24May 1, 2025Updated 11 months ago
- 2SSP: A Two-Stage Framework for Structured Pruning of LLMs☆21Aug 18, 2025Updated 8 months ago
- Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"☆48May 25, 2022Updated 3 years ago