ChandlerGuan/Transkimmer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ChandlerGuan/Transkimmer)

ChandlerGuan / Transkimmer

Code for ACL2022 publication Transkimmer: Transformer Learns to Layer-wise Skim

☆22

Alternatives and similar repositories for Transkimmer

Users that are interested in Transkimmer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

uchuhimo / amanda
View on GitHub
☆18Apr 21, 2024Updated 2 years ago
amodaresi / AdapLeR
View on GitHub
☆21Nov 26, 2022Updated 3 years ago
mlpen / LookupFFN
View on GitHub
☆21Mar 7, 2024Updated 2 years ago
SJTU-ReArch-Group / Paper-Reading-List
View on GitHub
☆154Updated this week
kwantam / fffft
View on GitHub
fft impl for ff::Field
☆17May 9, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
illinois-impact / klap
View on GitHub
A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches
☆15Jun 21, 2019Updated 7 years ago
clovaai / length-adaptive-transformer
View on GitHub
Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)
☆102Nov 2, 2020Updated 5 years ago
SJTU-IPADS / fgnn-artifacts
View on GitHub
FGNN's artifact evaluation (EuroSys 2022)
☆18Apr 25, 2022Updated 4 years ago
YouAreSpecialToMe / QST
View on GitHub
Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models
☆49Nov 5, 2024Updated last year
usyd-fsalab / ReadingList
View on GitHub
☆13Apr 27, 2022Updated 4 years ago
clevercool / ANT-Quantization
View on GitHub
☆123Nov 17, 2023Updated 2 years ago
IBM / PoWER-BERT
View on GitHub
Method to improve inference time for BERT. This is an implementation of the paper titled "PoWER-BERT: Accelerating BERT Inference via Pro…
☆63Sep 17, 2025Updated 10 months ago
yashbonde / RNN-sim
View on GitHub
Running massive simulations using RNNs on CPUs for building bots and all kinds of things.
☆12Jun 13, 2021Updated 5 years ago
mlpc-ucsd / BERT_Convolutions
View on GitHub
(ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.
☆21Jul 13, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
DRSY / EasyKV
View on GitHub
Easy control for Key-Value Constrained Generative LLM Inference(https://arxiv.org/abs/2402.06262)
☆62Feb 13, 2024Updated 2 years ago
jiazhihao / attention_superoptimizer
View on GitHub
An Attention Superoptimizer
☆22Jan 20, 2025Updated last year
mojsaeed / RuleBert
View on GitHub
☆20Mar 30, 2022Updated 4 years ago
ptlmasking / maskbert
View on GitHub
☆20Dec 16, 2020Updated 5 years ago
cmd2001 / jLock
View on GitHub
Python Script to Open SJTU Dormitory Smart Lock
☆10Sep 12, 2022Updated 3 years ago
facebookresearch / task_bench
View on GitHub
The TaskBench500 dataset and code for generating tasks.
☆16Jul 16, 2022Updated 4 years ago
yeachan-kr / c2a
View on GitHub
Pytorch implementations of Client-Customized Adaptation for Parameter-Efficient Federated Learning (Findings of ACL: ACL 2023)
☆17Oct 9, 2023Updated 2 years ago
Liuhong99 / implicitbiasmlmcode
View on GitHub
☆13Mar 22, 2023Updated 3 years ago
llyx97 / Rosita
View on GitHub
[AAAI 2021] "ROSITA: Refined BERT cOmpreSsion with InTegrAted techniques", Yuanxin Liu, Zheng Lin, Fengcheng Yuan
☆14Oct 18, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
fuzihaofzh / AnalyzeParameterEfficientFinetune
View on GitHub
On the Effectiveness of Parameter-Efficient Fine-Tuning
☆39Nov 4, 2023Updated 2 years ago
alibaba / SimCSE-with-CARDS
View on GitHub
Source code for SIGIR 2022 paper.
☆16Apr 25, 2022Updated 4 years ago
GPUPeople / GPUMemManSurvey
View on GitHub
Evaluating different memory managers for dynamic GPU memory
☆26Dec 16, 2020Updated 5 years ago
minhtannguyen / FourierFormer_NeurIPS
View on GitHub
☆13Oct 15, 2022Updated 3 years ago
thunlp / MoEfication
View on GitHub
☆146Jul 21, 2024Updated 2 years ago
Klitter / A-Bayesian-Federated-Learning-Framework-with-Online-Laplace-Approximation
View on GitHub
☆10Jul 21, 2021Updated 5 years ago
nanfangAlan / FSRFER
View on GitHub
a TensorFlow implementation of the paper "Feature Super-Resolution Based Facial Expression Recognition for Multi-scale Low-Resolution Ima…
☆13Nov 30, 2021Updated 4 years ago
cdaymand / slaythecli
View on GitHub
SlayTheCli: A console client for the game Slay The Spire
☆17Jul 12, 2020Updated 6 years ago
IlanPrice / DCTpS
View on GitHub
Code for testing DCT plus Sparse (DCTpS) networks
☆14Jun 15, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kuleshov-group / MODULoRA-Experiment
View on GitHub
Evaluation Code repository for the paper "ModuLoRA: Finetuning 3-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers". (2023…
☆13Dec 5, 2023Updated 2 years ago
activatedgeek / tight-pac-bayes
View on GitHub
Code for PAC-Bayes Compression Bounds So Tight That They Can Explain Generalization, NeurIPS 2022
☆18Nov 23, 2022Updated 3 years ago
raymin0223 / fast_robust_early_exit
View on GitHub
Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)
☆67Sep 28, 2024Updated last year
Engineev / mocker
View on GitHub
A compiler for course Compiler 2019
☆16Jan 9, 2020Updated 6 years ago
TonyTangYu / pytorch
View on GitHub
DELTA-pytorch：DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation
☆12Apr 16, 2024Updated 2 years ago
HanGuo97 / AutoSeM
View on GitHub
Code and Models for paper "AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning. Han Guo, Ramakanth Pasunuru, and Mohit Ba…
☆24Apr 15, 2019Updated 7 years ago
Raphael-Hao / brainstorm
View on GitHub
Compiler for Dynamic Neural Networks
☆45Nov 13, 2023Updated 2 years ago