cambridgeltl / ClaPS
Survival of the Most Influential Prompts: Efficient Black-Box Prompt Search via Clustering and Pruning (Zhou et al.; EMNLP 2023 Findings)
☆16Updated 7 months ago
Related projects: ⓘ
- Codebase for Inference-Time Policy Adapters☆19Updated 10 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆46Updated 5 months ago
- Code for paper - On Diversified Preferences of Large Language Model Alignment☆14Updated last month
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆48Updated 6 months ago
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆22Updated last month
- Code for the paper <SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning>☆39Updated last year
- ☆80Updated 9 months ago
- Restore safety in fine-tuned language models through task arithmetic☆25Updated 5 months ago
- ☆29Updated 10 months ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆9Updated 10 months ago
- ☆44Updated 2 weeks ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆38Updated 10 months ago
- Knowledge Circuits in Pretrained Transformers☆46Updated this week
- ☆39Updated 9 months ago
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆22Updated last year
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆37Updated last year
- ☆22Updated 2 months ago
- Parsimonious Concept Engineering (PaCE) uses sparse coding on a large-scale concept dictionary to effectively improve the trustworthiness…☆25Updated 3 months ago
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆49Updated 2 months ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆51Updated last year
- Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator support…☆34Updated last year
- ☆32Updated 10 months ago
- Mostly recording papers about models' trustworthy applications. Intending to include topics like model evaluation & analysis, security, c…☆18Updated last year
- ☆69Updated 10 months ago
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆45Updated 3 months ago
- ☆25Updated 3 months ago
- Lightweight Adapting for Black-Box Large Language Models☆16Updated 7 months ago
- Model Editing Can Hurt General Abilities of Large Language Models☆29Updated 7 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆62Updated 3 months ago
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆14Updated 9 months ago