michaelrzhang / LLM-HyperOpt
Using Large Language Models for Hyperparameter Optimization
☆14Updated 10 months ago
Alternatives and similar repositories for LLM-HyperOpt:
Users that are interested in LLM-HyperOpt are comparing it to the libraries listed below
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 6 months ago
- ☆31Updated 2 months ago
- ☆18Updated 8 months ago
- Official repo of Progressive Data Expansion: data, code and evaluation☆28Updated last year
- NeurIPS 2024 tutorial on LLM Inference☆39Updated 3 months ago
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆19Updated 7 months ago
- Lottery Ticket Adaptation☆39Updated 4 months ago
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆29Updated 8 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- A testbed for agents and environments that can automatically improve models through data generation.☆22Updated 3 weeks ago
- ☆31Updated 3 months ago
- Code for T-MARS data filtering☆35Updated last year
- The code of arXiv paper: "Dynamic Scaling of Unit Tests for Code Reward Modeling"☆17Updated 2 months ago
- Recycling diverse models☆44Updated 2 years ago
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago
- ☆14Updated 4 months ago
- ☆48Updated 4 months ago
- ☆13Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 9 months ago
- What Makes a Reward Model a Good Teacher? An Optimization Perspective☆15Updated last week
- Self-Supervised Alignment with Mutual Information☆16Updated 10 months ago
- ☆15Updated last year
- ☆25Updated 7 months ago
- Official repository of S-Agents: Self-organizing Agents in Open-ended Environment☆21Updated last year
- On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆38Updated 2 months ago
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆36Updated last year
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆70Updated 4 months ago
- ☆18Updated last week
- Official repository for Decentralized Arena via Collective LLM Intelligence☆9Updated 5 months ago
- [IJCAI 2023] Black-box Prompt Tuning for Vision-Language Model as a Service☆17Updated last year