txsun1997 / Black-Box-TuningLinks
ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022: BBTv2: Towards a Gradient-Free Future with Large Language Models
☆269Updated 2 years ago
Alternatives and similar repositories for Black-Box-Tuning
Users that are interested in Black-Box-Tuning are comparing it to the libraries listed below
Sorting:
- ☆177Updated last year
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆197Updated 2 years ago
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.☆132Updated 2 years ago
- An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi☆270Updated 2 years ago
- ☆63Updated 2 years ago
- 🎁[ChatGPT4NLU] A Comparative Study on ChatGPT and Fine-tuned BERT☆195Updated 2 years ago
- ☆348Updated 4 years ago
- ACL'23: Unified Demonstration Retriever for In-Context Learning☆38Updated last year
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆139Updated 3 months ago
- Released code for our ICLR23 paper.☆65Updated 2 years ago
- contrastive decoding☆203Updated 2 years ago
- ☆279Updated 7 months ago
- Must-read papers on improving efficiency for pre-trained language models.☆105Updated 2 years ago
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆167Updated 3 years ago
- [ICLR 2022] Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners☆132Updated 2 years ago
- [NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation☆154Updated 2 years ago
- Must-read Papers of Parameter-Efficient Tuning (Delta Tuning) Methods on Pre-trained Models.☆286Updated 2 years ago
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆102Updated 2 years ago
- Paper collections of retrieval-based (augmented) language model.☆234Updated last year
- MEND: Fast Model Editing at Scale☆249Updated last year
- ☆53Updated last year
- ☆75Updated last year
- ☆87Updated 2 years ago
- Code for the ACL-2022 paper "Knowledge Neurons in Pretrained Transformers"☆171Updated last year
- Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》☆61Updated 3 years ago
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models☆23Updated last year
- Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets☆341Updated last year
- [ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408☆197Updated 2 years ago
- Do Large Language Models Know What They Don’t Know?☆99Updated 9 months ago
- Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃☆114Updated 2 years ago