psunlpgroup / GreaTerLinks
Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers
☆28Updated last month
Alternatives and similar repositories for GreaTer
Users that are interested in GreaTer are comparing it to the libraries listed below
Sorting:
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆47Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 4 months ago
- ☆20Updated 7 months ago
- Exploration of automated dataset selection approaches at large scales.☆42Updated 3 months ago
- A repository for research on medium sized language models.☆76Updated last year
- ☆65Updated 2 months ago
- Codebase for Instruction Following without Instruction Tuning☆34Updated 8 months ago
- ☆23Updated 2 months ago
- [ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning☆53Updated 2 months ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Updated 7 months ago
- Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆39Updated last month
- Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF☆18Updated 7 months ago
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆42Updated last week
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆25Updated 2 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆16Updated 2 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆50Updated 5 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- A testbed for agents and environments that can automatically improve models through data generation.☆24Updated 3 months ago
- Verifiers for LLM Reinforcement Learning☆56Updated last month
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆21Updated 3 months ago
- ☆28Updated last year
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆37Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 4 months ago
- ☆17Updated last month
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆36Updated 5 months ago
- ☆51Updated last year
- The repository contains code for Adaptive Data Optimization☆24Updated 5 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆47Updated 6 months ago
- Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory☆62Updated last week