☆21Jul 5, 2024Updated last year
Alternatives and similar repositories for peft
Users that are interested in peft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scaling Sparse Fine-Tuning to Large Language Models☆19Jan 31, 2024Updated 2 years ago
- The official repository for Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapte…☆17Jan 15, 2024Updated 2 years ago
- [ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…☆11Nov 28, 2023Updated 2 years ago
- [Neurips 2021] Sparse Training via Boosting Pruning Plasticity with Neuroregeneration☆31Feb 11, 2023Updated 3 years ago
- ☆11Nov 13, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆42Oct 31, 2024Updated last year
- [ICML 2021] "Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training" by Shiwei Liu, Lu Yin, De…☆45Nov 11, 2023Updated 2 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- This is the official code for UGTs.☆13Feb 8, 2023Updated 3 years ago
- [EMNLP'2023 Findings] MoqaGPT, for zero-shot multimodal question answering with LLMs☆13Dec 28, 2024Updated last year
- Code for testing DCT plus Sparse (DCTpS) networks☆14Jun 15, 2021Updated 5 years ago
- Residual Prompt Tuning: a method for faster and better prompt tuning.☆56May 10, 2023Updated 3 years ago
- Official repository of Sparse ISO-FLOP Transformations for Maximizing Training Efficiency☆25Jul 31, 2024Updated last year
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Apr 21, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, …☆18Dec 30, 2021Updated 4 years ago
- 校色文件☆11Aug 27, 2020Updated 5 years ago
- 国内最新省市区数据库,包含名称、城市代码、上级代码、级别、邮编。☆16May 31, 2017Updated 9 years ago
- PyTorch implementation of Language model compression with weighted low-rank factorization☆14Jun 28, 2023Updated 2 years ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- ☆10Mar 2, 2024Updated 2 years ago
- Learning adapter weights from task descriptions☆19Nov 12, 2023Updated 2 years ago
- PyTorch implementation of LARS (Layer-wise Adaptive Rate Scaling)☆20May 11, 2019Updated 7 years ago
- Re-implementation of Exploiting Edge Features in Graph Neural Networks☆11Apr 7, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆15Oct 31, 2023Updated 2 years ago
- Fish4Knowledge dataset cleaning, UOE 4th Year Honours Project.☆11Jun 13, 2018Updated 8 years ago
- ☆16Sep 27, 2023Updated 2 years ago
- [NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…☆15Oct 18, 2022Updated 3 years ago
- Change-point detection using neural networks☆22Dec 6, 2023Updated 2 years ago
- ☆15May 26, 2026Updated 3 weeks ago
- Multicultural Proverbs and Sayings☆13Jan 11, 2025Updated last year
- 🔥 🔥 [WACV2024] Mini but Mighty: Finetuning ViTs with Mini Adapters☆20Jul 5, 2024Updated last year
- FuseAI Project☆93Jan 25, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"☆12Oct 14, 2025Updated 8 months ago
- ☆26Nov 23, 2023Updated 2 years ago
- A implement of run-length encoding for Pytorch tensor using CUDA☆14Apr 7, 2021Updated 5 years ago
- ☆22Jun 11, 2024Updated 2 years ago
- compare the theory attention gradient with PyTorch attention gradient☆16Apr 1, 2024Updated 2 years ago
- interact with your robot in JS, inspired by LeRobot☆38Nov 14, 2025Updated 7 months ago
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆22Oct 15, 2024Updated last year