cliang1453/super-structured-lottery-tickets

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cliang1453/super-structured-lottery-tickets)

cliang1453 / super-structured-lottery-tickets

Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization (ACL 2021)

☆19

Alternatives and similar repositories for super-structured-lottery-tickets

Users that are interested in super-structured-lottery-tickets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

llyx97 / TAMT
View on GitHub
[NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…
☆15Oct 18, 2022Updated 3 years ago
cliang1453 / SAGE
View on GitHub
No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)
☆29Feb 9, 2022Updated 4 years ago
Lingkai-Kong / Calibrated-BERT-Fine-Tuning
View on GitHub
Code for Paper: Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data
☆36Nov 16, 2020Updated 5 years ago
QingruZhang / PLATON
View on GitHub
This pytorch package implements PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance (ICML 2022).
☆45Oct 17, 2022Updated 3 years ago
evasharma / bigpatent
View on GitHub
☆25Jun 25, 2019Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
llyx97 / Rosita
View on GitHub
[AAAI 2021] "ROSITA: Refined BERT cOmpreSsion with InTegrAted techniques", Yuanxin Liu, Zheng Lin, Fengcheng Yuan
☆14Oct 18, 2022Updated 3 years ago
yuna970129 / denoise-ECG-signal-by-GAN
View on GitHub
☆11Apr 19, 2021Updated 5 years ago
SprocketLab / Alchemist
View on GitHub
☆12Mar 4, 2025Updated last year
VITA-Group / BERT-Tickets
View on GitHub
[NeurIPS 2020] "The Lottery Ticket Hypothesis for Pre-trained BERT Networks", Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Ya…
☆141Dec 30, 2021Updated 4 years ago
sai-prasanna / bert-experiments
View on GitHub
☆19Oct 6, 2020Updated 5 years ago
text-machine-lab / adversarial_decomposition
View on GitHub
The code for the paper "Adversarial Decomposition of Text Representation", NAACL 2019
☆29Dec 8, 2022Updated 3 years ago
rsvp-ai / segatron_aaai
View on GitHub
codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"
☆18Oct 25, 2022Updated 3 years ago
NLP-Playground / LaSS
View on GitHub
☆31Apr 27, 2022Updated 4 years ago
Form2Seq-Data / Dataset
View on GitHub
Dataset corresponding to the paper: "Form2Seq : A Framework for Higher-Order Form Structure Extraction"
☆10Feb 17, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lalalamdbf / PLSE_IDRR
View on GitHub
The Code for the EMNLP 2023 main conference paper "Prompt-based Logical Semantics Enhancement for Implicit Discourse Relation Recognition…
☆13Dec 10, 2023Updated 2 years ago
amzn / amazon-weak-ner-needle
View on GitHub
Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data
☆101Jul 25, 2023Updated 3 years ago
TeamLab / pdcde2018
View on GitHub
Full paper available on Researchgate
☆17Oct 21, 2018Updated 7 years ago
SHI-Labs / DiSparse-Multitask-Model-Compression
View on GitHub
[CVPR 2022] DiSparse: Disentangled Sparsification for Multitask Model Compression
☆14Sep 6, 2022Updated 3 years ago
cordercorder / nmt-multi
View on GitHub
Codebase for multilingual neural machine translation
☆13Nov 24, 2022Updated 3 years ago
ohlionel / Prune-Tune
View on GitHub
Official code repository for AAAI2021 paper Finding Sparse Structures for Domain Specific Neural Machine Translation
☆11Apr 1, 2021Updated 5 years ago
cbaziotis / lm-prior-for-nmt
View on GitHub
This repository contains source code for the paper "Language Model Prior for Low-Resource Neural Machine Translation"
☆43Mar 16, 2021Updated 5 years ago
ntunlp / ptrnet-depparser
View on GitHub
☆11Oct 13, 2019Updated 6 years ago
varun19299 / rigl-reproducibility
View on GitHub
Reproducing RigL (ICML 2020) as a part of ML Reproducibility Challenge 2020
☆29Jan 6, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
NLP2CT / norm-nmt
View on GitHub
Norm-Based Curriculum Learning for Neural Machine Translation (ACL 2020)
☆18Aug 1, 2020Updated 5 years ago
popfido / PairCNN-Ranking
View on GitHub
☆13Apr 9, 2018Updated 8 years ago
cohere-samples / cohere-slack-starter-app
View on GitHub
Co:here-powered Slack App Starter Project
☆13Apr 1, 2022Updated 4 years ago
gregdeon / spotlight
View on GitHub
Implementation of the spotlight: a method for discovering systematic errors in deep learning models
☆11Oct 5, 2021Updated 4 years ago
cliang1453 / task-aware-distillation
View on GitHub
Less is More: Task-aware Layer-wise Distillation for Language Model Compression (ICML2023)
☆40Aug 28, 2023Updated 2 years ago
ictnlp / NA-MNMT
View on GitHub
Source code for "Importance-based Neuron Allocation for Multilingual Neural Machine Translation"
☆12Sep 15, 2021Updated 4 years ago
yunan4nlp / NNDisParser
View on GitHub
☆10Aug 30, 2022Updated 3 years ago
Chrisa142857 / You-Only-Look-Cytopathology-Once
View on GitHub
Codes available of a paper: An Efficient Cervical Whole Slide Image Analysis Framework Based on Multi-scale Semantic and Location Deep Fe…
☆16Jul 26, 2022Updated 4 years ago
delta2323 / GB-GNN
View on GitHub
Optimization and Generalization Analysis of Transduction through Gradient Boosting and Application to Multi-scale Graph Neural Networks
☆13Jun 16, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AI0Research / MRDL-and-MRDR
View on GitHub
☆10Apr 5, 2025Updated last year
ygjin11 / task-hypernet
View on GitHub
The official implementation of the paper "Deep Reinforcement Learning with Task-Adaptive Retrieval via Hypernetwork".
☆12Feb 27, 2024Updated 2 years ago
malihealikhani / CITE
View on GitHub
CITE: A Corpus of Image-Text Discourse Relations
☆13Apr 7, 2019Updated 7 years ago
SimiaoZuo / MoEBERT
View on GitHub
This PyTorch package implements MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation (NAACL 2022).
☆114May 2, 2022Updated 4 years ago
yilinyang7 / fairseq_multi_fix
View on GitHub
Code and Data release for "Improving Multilingual Translation by Representation and Gradient Regularization" (Yang et al. EMNLP 2021), an…
☆13Aug 12, 2024Updated last year
bertsky / ocrd_publaynet
View on GitHub
convert PubLayNet data into METS/PAGE-XML
☆10Mar 17, 2020Updated 6 years ago
yueyu1030 / COSINE
View on GitHub
[NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…
☆205Aug 17, 2022Updated 3 years ago