bzantium/pytorch-PKD-for-BERT-compression

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bzantium/pytorch-PKD-for-BERT-compression)

bzantium / pytorch-PKD-for-BERT-compression

☆15

Alternatives and similar repositories for pytorch-PKD-for-BERT-compression

Users that are interested in pytorch-PKD-for-BERT-compression are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JetRunner / MetaDistil
View on GitHub
Code for ACL 2022 paper "BERT Learns to Teach: Knowledge Distillation with Meta Learning".
☆86Aug 4, 2022Updated 3 years ago
intersun / PKD-for-BERT-Model-Compression
View on GitHub
pytorch implementation for Patient Knowledge Distillation for BERT Model Compression
☆203Sep 20, 2019Updated 6 years ago
PaddlePaddle / Perf
View on GitHub
SOTA benchmark
☆18Aug 8, 2023Updated 2 years ago
JindongGu / SimDis
View on GitHub
A pytorch implementation of the ICCV2021 workshop paper SimDis: Simple Distillation Baselines for Improving Small Self-supervised Models
☆14Jul 15, 2021Updated 5 years ago
wanggangkun / ST-Text-GCN
View on GitHub
Code for paper "Self-training Method Based on GCN for Semi-supervised Short Text Classification"
☆11Oct 30, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
BM-K / Troll-Detector
View on GitHub
Troll Detector
☆15Nov 28, 2022Updated 3 years ago
nhthang9x / HAN-Text-Classification-Pytorch
View on GitHub
My pytorch implementation of the model described in the paper **Hierarchical Attention Networks for Document Classification** [paper](htt…
☆10Mar 22, 2019Updated 7 years ago
BM-K / KoDiffCSE
View on GitHub
Difference-based Contrastive Learning for Korean Sentence Embeddings
☆23Mar 11, 2026Updated 4 months ago
MichaelZhouwang / Sequence_Span_Rewriting
View on GitHub
Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting
☆17Nov 30, 2021Updated 4 years ago
luofuli / A-Review-of-Text-Style-Transfer
View on GitHub
Text Style Transfer: A Review
☆13Jun 1, 2019Updated 7 years ago
JetRunner / BERT-of-Theseus
View on GitHub
⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).
☆316Jun 12, 2023Updated 3 years ago
liucongg / 2021-GAIIC-Task3-Share
View on GitHub
全球人工智能技术创新大赛-赛道三：小布助手对话短文本语义匹配
☆38Mar 29, 2021Updated 5 years ago
RuifengYuan / FactExsum-coling2020
View on GitHub
Code for Fact-level Extractive Summarization with Hierarchical Graph Mask on BERT (coling 2020)
☆16Mar 25, 2023Updated 3 years ago
shawnricecake / squant
View on GitHub
[ICCAD 2025] Squant
☆15Jul 3, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
alycialee / beyond-scale-language-data-diversity
View on GitHub
☆13Jul 22, 2026Updated last week
multimodal-art-projection / CodeCriticBench
View on GitHub
☆16Nov 1, 2025Updated 8 months ago
EmbraceLife / My_Journey_on_Kaggle
View on GitHub
Learn from amazing Kagglers on Kaggle
☆12Feb 26, 2023Updated 3 years ago
wyu-du / Controlled-Dialogue-Generation
View on GitHub
This repository contains the data and code for the paper "SideControl: Controlled Open-domain Dialogue Generation via Additive Side Netwo…
☆12Dec 1, 2021Updated 4 years ago
ZKI-PH-ImageAnalysis / Next-Generation-Loss
View on GitHub
☆12Jan 8, 2025Updated last year
JackShDr / InfluentialRS
View on GitHub
Implementations of Influential Recommender System
☆12Oct 29, 2024Updated last year
sfzhou5678 / Lancer
View on GitHub
Code for paper "Lancer: Your Code Tell Me What You Need"
☆11Jun 17, 2022Updated 4 years ago
TobeyYang / S2S_Temp
View on GitHub
Code for EMNLP2019 paper "Low-Resource Response Generation with Template Prior"
☆12Jan 17, 2020Updated 6 years ago
StigLidu / TURN
View on GitHub
[ICML2025] Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"
☆23Feb 16, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
gcosne / OceanographyProject
View on GitHub
Today satellites provide a surface signature of the temperature with a high spatial frequency: ie a good horizontal resolution but a low …
☆13Oct 23, 2019Updated 6 years ago
ShubhayanS / Detecting-Cyclone-Centers-Custom-YOLOv3
View on GitHub
TROPICAL cyclones (TCs) are intense warm-corded cyclonic vortices, developed from low-pressure systems over the tropical oceans and drive…
☆11Sep 1, 2020Updated 5 years ago
Guillem96 / data2vec-vision
View on GitHub
PyTorch implementation of Data2Vec self-supervised approach for vision use cases.
☆18Oct 7, 2022Updated 3 years ago
ayushidalmia / Phrase-Based-Model
View on GitHub
Implementation of Phrase Based Model to translate sentences from English to German and vice versa
☆12May 23, 2014Updated 12 years ago
chenghuige / feedback-prize-2021
View on GitHub
9th solution
☆11Oct 11, 2022Updated 3 years ago
amanchadha / LeetCode-Solutions
View on GitHub
(Weekly Update) Python / Modern C++ Solutions of All 1643 LeetCode Problems
☆13Nov 3, 2020Updated 5 years ago
uservan / ThinkPO
View on GitHub
☆17Aug 1, 2025Updated 11 months ago
RunxinXu / ContrastivePruning
View on GitHub
Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》
☆25Dec 15, 2021Updated 4 years ago
murray-z / knowledge_distillation
View on GitHub
knowledge distillation: 采用知识蒸馏，训练bert后指导textcnn
☆19Apr 29, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
MSWon / Transformer-Encoder-with-Char
View on GitHub
Transformer Encoder with Char information for text classification
☆15Jan 17, 2020Updated 6 years ago
cycloneintensity / CrossKnotHacks-Cyclonet
View on GitHub
CycloNet is a Deep Learning based web-app for Cyclone intensity computation using INSAT-3D Cyclone Imagery
☆13Sep 17, 2023Updated 2 years ago
AndrewMishchenko / sqltomongo
View on GitHub
Translates SQL queries to MongoDB queries.
☆11Jun 15, 2017Updated 9 years ago
gsh199449 / proto-summ
View on GitHub
Dataset proposed by ''How to Write Summaries with Patterns? Learning towards Abstractive Summarization through Prototype Editing''
☆18May 4, 2021Updated 5 years ago
snu-mllab / DisentanglementICML19
View on GitHub
"Learning Discrete and Continuous Factors of Data via Alternating Disentanglement" accepted at ICML2019
☆22Aug 22, 2019Updated 6 years ago
tau-nlp / zero_scrolls
View on GitHub
Running inference on the ZeroSCROLLS benchmark
☆22Apr 18, 2024Updated 2 years ago
castorini / d-bert
View on GitHub
Distilling BERT using natural language generation.
☆39Aug 13, 2023Updated 2 years ago