michaelrzhang / LLM-HyperOptLinks

Using Large Language Models for Hyperparameter Optimization

☆15

Alternatives and similar repositories for LLM-HyperOpt

Users that are interested in LLM-HyperOpt are comparing it to the libraries listed below

Sorting:

codezakh / DataEnvGym
A testbed for agents and environments that can automatically improve models through data generation.
☆24Updated 3 months ago
katiekang1998 / reasoning_generalization
☆32Updated 5 months ago
abaheti95 / LoL-RL
Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients
☆26Updated 8 months ago
Gabesarch / ICAL
☆40Updated 3 weeks ago
cassidylaidlaw / orpo
☆15Updated 6 months ago
mandyyyyii / east
☆18Updated last month
facebookresearch / dualformer
implementation of dualformer
☆17Updated 3 months ago
McGill-NLP / agent-reward-bench
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
☆15Updated 3 weeks ago
holarissun / RewardModelingBeyondBradleyTerry
official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…
☆58Updated 2 months ago
ZhaolinGao / REFUEL
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
☆18Updated 8 months ago
WEIRDLabUW / vpl_llm
Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"
☆18Updated 9 months ago
tianjunz / TEMPERA
☆44Updated 2 years ago
HenryLau7 / CFPO
☆20Updated 4 months ago
facebookresearch / ModelRatatouille
Recycling diverse models
☆44Updated 2 years ago
kiddyboots216 / lottery-ticket-adaptation
Lottery Ticket Adaptation
☆39Updated 6 months ago
hartvigsen-group / composable-interventions
☆28Updated 3 months ago
ablghtianyi / ICL_Modular_Arithmetic
☆19Updated 2 months ago
Gen-Verse / CURE
Open-Source LLM Coders with Co-Evolving Reinforcement Learning
☆40Updated this week
ZhentingWang / DUMP
☆19Updated last month
shenao-zhang / SELM
The official implementation of Self-Exploring Language Models (SELM)
☆64Updated last year
kokolerk / TON
Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models
☆36Updated 2 weeks ago
limenlp / safer-instruct
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Updated last year
wang-kee / LiNeS
Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"
☆26Updated 7 months ago
dinobby / MAGDi
The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…
☆34Updated last year
LAMDASZ-ML / Self-Backtracking
☆45Updated 3 months ago
lapisrocks / DiscreteAdversarialDistillation
[NeurIPS 2023] Official repository for "Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models"
☆12Updated 11 months ago
hengzzzhou / ReSo
☆13Updated 2 months ago
cmu-l3 / neurips2024-inference-tutorial-code
NeurIPS 2024 tutorial on LLM Inference
☆45Updated 5 months ago
open-compass / ProSA
[EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs
☆27Updated 2 weeks ago
microsoft / tale-suite
Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.
☆14Updated last week