mitchellgordon95/bert-prune

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mitchellgordon95/bert-prune)

mitchellgordon95 / bert-prune

☆17

Alternatives and similar repositories for bert-prune

Users that are interested in bert-prune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

heartcored98 / transformer_anatomy
View on GitHub
Official Pytorch implementation of (Roles and Utilization of Attention Heads in Transformer-based Neural Language Models), ACL 2020
☆16Mar 21, 2025Updated last year
llyx97 / Rosita
View on GitHub
[AAAI 2021] "ROSITA: Refined BERT cOmpreSsion with InTegrAted techniques", Yuanxin Liu, Zheng Lin, Fengcheng Yuan
☆14Oct 18, 2022Updated 3 years ago
ttambe / AdaptivFloat
View on GitHub
Adaptive floating-point based numerical format for resilient deep learning
☆14Apr 11, 2022Updated 4 years ago
dodgejesse / show_your_work
View on GitHub
☆11Jan 21, 2020Updated 6 years ago
huggingface / block_movement_pruning
View on GitHub
Block Sparse movement pruning
☆83Nov 26, 2020Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
fpgadeveloper / ethernet-fmc-processorless
View on GitHub
Example designs for using Ethernet FMC without a processor (ie. state machine based)
☆35Jul 7, 2026Updated 2 weeks ago
neubig / rapid-adaptation
View on GitHub
Reproduction instructions for "Rapid Adaptation of Neural Machine Translation to New Languages"
☆39Aug 7, 2018Updated 7 years ago
DomHudson / bert-in-production
View on GitHub
A collection of resources on using BERT (https://arxiv.org/abs/1810.04805 ) and related Language Models in production environments.
☆96Apr 8, 2021Updated 5 years ago
VITA-Group / BERT-Tickets
View on GitHub
[NeurIPS 2020] "The Lottery Ticket Hypothesis for Pre-trained BERT Networks", Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Ya…
☆141Dec 30, 2021Updated 4 years ago
DerrickYLJ / TidalDecode
View on GitHub
[ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention
☆57Aug 6, 2025Updated 11 months ago
mimno / PyMallet
View on GitHub
Python tools for text
☆16May 8, 2020Updated 6 years ago
LinuxSuRen / yaml-readme
View on GitHub
A helper to generate the READE file automatically from YAML-based metadata files.
☆19May 23, 2024Updated 2 years ago
bdhingra / coref-gru
View on GitHub
Model for processing text sequences with coreference annotations
☆14Nov 29, 2018Updated 7 years ago
forkonlp / newspaper
View on GitHub
대부분의 신문사 뉴스를 수집하는 것을 목적으로 하는 크롤러 제작 프로젝트
☆11Jul 29, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
BeckyMarvin / LM_syneval
View on GitHub
Code for replicating the work in "Targeted Syntactic Evaluation of Language Models." EMNLP 2018.
☆44Apr 25, 2020Updated 6 years ago
keris2020 / hackathon
View on GitHub
☆10Nov 6, 2020Updated 5 years ago
gao-xiao-bai / JsonTuning
View on GitHub
JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning
☆10Nov 3, 2024Updated last year
dmoltisanti / air-cvpr23
View on GitHub
This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…
☆13May 25, 2023Updated 3 years ago
lsvih / AtTGen
View on GitHub
Code for "AtTGen: Attribute Tree Generation for Real-World Attribute Joint Extraction", ACL 2023
☆13May 19, 2023Updated 3 years ago
zomux / lanmt-ebm
View on GitHub
lanmt ebm
☆12Jun 19, 2020Updated 6 years ago
MAC-AutoML / YOCO-BERT
View on GitHub
The official implementation of You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Natu…
☆48Jul 1, 2021Updated 5 years ago
iesl / leopard
View on GitHub
☆24Nov 27, 2020Updated 5 years ago
IST-DASLab / sparse-imagenet-transfer
View on GitHub
Code for reproducing the results in "How Well do Sparse Imagenet Models Transfer?", presented at CVPR 2022
☆10Jun 3, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
pufanyi / syphus
View on GitHub
Syphus: Automatic Instruction-Response Generation Pipeline
☆14Dec 14, 2023Updated 2 years ago
rtaori / data_feedback
View on GitHub
Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"
☆18Sep 9, 2022Updated 3 years ago
roeeaharoni / string-to-tree-nmt
View on GitHub
Source code and data for the paper "Towards String-to-Tree Neural Machine Translation"
☆16Dec 31, 2017Updated 8 years ago
OHDSI / MedlineXmlToDatabase
View on GitHub
A command line Java application for parsing MEDLINE XML files and inserting the data into a relational database
☆19Aug 9, 2023Updated 2 years ago
zerodocom / ChatGPT-Plus
View on GitHub
☆16Feb 20, 2023Updated 3 years ago
faviq / faviq
View on GitHub
FaVIQ: Fact Verification from Information-seeking Questions
☆43Nov 23, 2022Updated 3 years ago
tissue3 / EyerissSimulator
View on GitHub
Eyeriss chip simulator
☆41Mar 6, 2020Updated 6 years ago
HydraQYH / expert_specialization_moe
View on GitHub
Expert Specialization MoE Solution based on CUTLASS
☆27Apr 14, 2026Updated 3 months ago
uclnlp / APE
View on GitHub
Adaptive Passage Encoder for Open-domain Question Answering
☆15Jun 1, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
brucespang / sammy
View on GitHub
Smoothing video traffic to make it a friendlier internet neighbor
☆14Apr 23, 2024Updated 2 years ago
kiaia / GIRAFFE
View on GitHub
Extending context length of visual language models
☆12Dec 18, 2024Updated last year
DFKI-NLP / RelEx
View on GitHub
RelEx - A simple framework for Relation Extraction built on AllenNLP
☆15Jun 17, 2020Updated 6 years ago
pmichel31415 / are-16-heads-really-better-than-1
View on GitHub
Code for the paper "Are Sixteen Heads Really Better than One?"
☆175Apr 1, 2020Updated 6 years ago
appl-lab / CuTS
View on GitHub
☆13Sep 8, 2021Updated 4 years ago
RedHenLab / ASR-for-Chinese-Pipeline
View on GitHub
Google Summer of Code 2018 Project: Automatic Speech Recognition for Speech-to-Text on Chinese
☆10Jan 11, 2019Updated 7 years ago
SymbioticLab / tensorflow-salus
View on GitHub
tensorflow fork with Salus integration
☆12Jan 7, 2022Updated 4 years ago