Code for ACL2022 publication Transkimmer: Transformer Learns to Layer-wise Skim
☆22Aug 21, 2022Updated 3 years ago
Alternatives and similar repositories for Transkimmer
Users that are interested in Transkimmer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Mar 7, 2024Updated 2 years ago
- fft impl for ff::Field☆17May 9, 2024Updated last year
- ☆145Dec 19, 2025Updated 3 months ago
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)☆102Nov 2, 2020Updated 5 years ago
- FGNN's artifact evaluation (EuroSys 2022)☆18Apr 25, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Method to improve inference time for BERT. This is an implementation of the paper titled "PoWER-BERT: Accelerating BERT Inference via Pro…☆62Sep 17, 2025Updated 6 months ago
- Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models☆49Nov 5, 2024Updated last year
- ☆20Mar 30, 2022Updated 3 years ago
- Easy control for Key-Value Constrained Generative LLM Inference(https://arxiv.org/abs/2402.06262)☆62Feb 13, 2024Updated 2 years ago
- Implementation for Variational Information Bottleneck for Effective Low-resource Fine-tuning, ICLR 2021☆43May 10, 2021Updated 4 years ago
- On the Effectiveness of Parameter-Efficient Fine-Tuning☆38Nov 4, 2023Updated 2 years ago
- [AAAI 2021] "ROSITA: Refined BERT cOmpreSsion with InTegrAted techniques", Yuanxin Liu, Zheng Lin, Fengcheng Yuan☆14Oct 18, 2022Updated 3 years ago
- Pytorch implementations of Client-Customized Adaptation for Parameter-Efficient Federated Learning (Findings of ACL: ACL 2023)☆17Oct 9, 2023Updated 2 years ago
- Source code for SIGIR 2022 paper.☆16Apr 25, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Evaluating different memory managers for dynamic GPU memory☆26Dec 16, 2020Updated 5 years ago
- 使用多头注意力机制实现数字预测☆10May 10, 2022Updated 3 years ago
- TIFMO: Textual Inference Forward-chaining MOdule☆12Apr 25, 2014Updated 11 years ago
- ☆13Oct 15, 2022Updated 3 years ago
- ☆10Jul 21, 2021Updated 4 years ago
- open source taxi dispatch software 出行加打车软件UI设计效果图☆14Dec 22, 2020Updated 5 years ago
- ☆11Aug 26, 2021Updated 4 years ago
- a TensorFlow implementation of the paper "Feature Super-Resolution Based Facial Expression Recognition for Multi-scale Low-Resolution Ima…☆13Nov 30, 2021Updated 4 years ago
- ☆28Aug 14, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Evaluation Code repository for the paper "ModuLoRA: Finetuning 3-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers". (2023…☆13Dec 5, 2023Updated 2 years ago
- Code for PAC-Bayes Compression Bounds So Tight That They Can Explain Generalization, NeurIPS 2022☆19Nov 23, 2022Updated 3 years ago
- ☆15Feb 8, 2023Updated 3 years ago
- Video handwritten digit recognition based on k-NN algorithm 基于k-NN算法的视频手写数字识别☆15Feb 2, 2021Updated 5 years ago
- CR-LT KGQA Dataset Repository☆10Jun 1, 2025Updated 9 months ago
- Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"☆48May 25, 2022Updated 3 years ago
- ShadowBound: Efficient Memory Protection through Advanced Metadata Management and Customized Compiler Optimization (USENIX Security 2024)…☆27Jul 31, 2024Updated last year
- A zero-shot faithfulness evaluation metric for text summarization☆11Oct 17, 2023Updated 2 years ago
- Compiler for Dynamic Neural Networks☆45Nov 13, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of 1D, 2D, and 3D FFT convolutions in PyTorch. Much faster than direct convolutions for large kernel sizes.☆14Jul 9, 2023Updated 2 years ago
- decontamination☆27Mar 4, 2026Updated 3 weeks ago
- GPU operators for sparse tensor operations☆35Mar 11, 2024Updated 2 years ago
- ☆14Aug 18, 2025Updated 7 months ago
- ☆12Sep 25, 2018Updated 7 years ago
- ☆12Apr 30, 2024Updated last year
- ☆17Nov 10, 2021Updated 4 years ago