yangheng95 / InstOptima
This repo is for our EMNLP2023 short paper (Findings): InstOptima: Evolutionary Multi-objective Instruction Optimization via Large Language Model-based Instruction Operators.
☆11Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for InstOptima
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆13Updated last week
- ☆41Updated last year
- Source code for the TMLR paper "Black-Box Prompt Learning for Pre-trained Language Models"☆55Updated last year
- ☆31Updated last year
- [IJCAI 2023] Black-box Prompt Tuning for Vision-Language Model as a Service☆15Updated last year
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆32Updated 7 months ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆23Updated 10 months ago
- ☆23Updated 5 months ago
- Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆10Updated 4 months ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆9Updated 8 months ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆29Updated 2 weeks ago
- GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems☆10Updated 2 years ago
- Lightweight Adapting for Black-Box Large Language Models☆18Updated 8 months ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆29Updated 8 months ago
- Rewarded soups official implementation☆50Updated last year
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆47Updated last week
- Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆13Updated 3 weeks ago
- EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning (ACL 2023)☆20Updated last year
- Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs☆31Updated 9 months ago
- Uncertainty quantification for in-context learning of large language models☆12Updated 7 months ago
- [ACL2024] A Codebase for Incremental Learning with Large Language Models; Official released code for "Learn or Recall? Revisiting Increme…☆20Updated 3 weeks ago
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆24Updated 8 months ago
- ☆19Updated last month
- Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning☆22Updated 3 months ago
- Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs" (under preparation)☆13Updated this week
- [AAAI 2024] MELO: Enhancing Model Editing with Neuron-indexed Dynamic LoRA☆21Updated 7 months ago
- A list of awesome papers and resources of the intersection of Large Language Models and Evolutionary Computation.☆34Updated 5 months ago
- Learning adapter weights from task descriptions☆15Updated last year
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆52Updated 2 months ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆49Updated 2 weeks ago