yangheng95 / InstOptimaLinks
This repo is for our EMNLP2023 short paper (Findings): InstOptima: Evolutionary Multi-objective Instruction Optimization via Large Language Model-based Instruction Operators.
☆12Updated last year
Alternatives and similar repositories for InstOptima
Users that are interested in InstOptima are comparing it to the libraries listed below
Sorting:
- ☆11Updated 8 months ago
- The official implementation of the paper "Deep Reinforcement Learning with Task-Adaptive Retrieval via Hypernetwork".☆12Updated last year
- ☆16Updated 10 months ago
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆25Updated 10 months ago
- [NeurIPS 2020 Spotlight Oral] "Training Stronger Baselines for Learning to Optimize", Tianlong Chen*, Weiyi Zhang*, Jingyang Zhou, Shiyu …☆28Updated 3 years ago
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆45Updated last year
- Official codebase for CuGRO: Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay☆32Updated last year
- ☆46Updated 2 years ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Updated last year
- Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer☆28Updated 2 years ago
- Rewarded soups official implementation☆60Updated last year
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆37Updated last year
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆25Updated 2 years ago
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆12Updated 3 years ago
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Updated 2 years ago
- ☆17Updated 4 years ago
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆41Updated last year
- Code associated with our paper "Estimating Risk and Uncertainty in Reinforcement Learning"☆11Updated last year
- GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems☆10Updated 3 years ago
- Source code for the TMLR paper "Black-Box Prompt Learning for Pre-trained Language Models"☆56Updated 2 years ago
- The LLMOPT project offers a comprehensive set of resources, including the model, dataset, training framework, and inference code, enablin…☆78Updated 4 months ago
- Codes for Evolving Plastic ANNs☆15Updated 2 years ago
- The code for our NeurIPS 2021 paper "Kernelized Heterogeneous Risk Minimization".☆13Updated 3 years ago
- Official implementation of "Improvable Gap Balancing for Multi-Task Learning".☆16Updated 2 years ago
- TransMix: Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning☆10Updated 2 years ago
- [ICLR2025 Spotlight] Advantage-Guided Distillation for Preference Alignment in Small Language Models☆22Updated 6 months ago
- ☆31Updated 2 years ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆36Updated last year
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Updated 4 years ago
- Evolutionary-Algorithm and Large-Language-Model☆19Updated 10 months ago