Moocember / Optimization-by-PROmptingLinks
☆78Updated last year
Alternatives and similar repositories for Optimization-by-PROmpting
Users that are interested in Optimization-by-PROmpting are comparing it to the libraries listed below
Sorting:
- ☆150Updated last year
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆81Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆97Updated last year
- ☆127Updated 11 months ago
- ☆122Updated last year
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆72Updated last year
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆62Updated last year
- ☆29Updated last month
- ☆183Updated 7 months ago
- Open Implementations of LLM Analyses☆106Updated 11 months ago
- ☆100Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆219Updated 2 years ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆118Updated last year
- augmented LLM with self reflection☆131Updated last year
- ☆135Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆115Updated 2 years ago
- A set of utilities for running few-shot prompting experiments on large-language models☆122Updated last year
- Official repo of Respond-and-Respond: data, code, and evaluation☆104Updated last year
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆163Updated last year
- Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts…☆196Updated last year
- Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation☆74Updated last year
- Simple next-token-prediction for RLHF☆227Updated last year
- ☆35Updated 2 years ago
- This is the official repository for Inheritune.☆113Updated 7 months ago
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated 2 years ago
- Evaluating LLMs with CommonGen-Lite☆91Updated last year
- Self-Alignment with Principle-Following Reward Models☆165Updated 4 months ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆50Updated 10 months ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆160Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆79Updated 11 months ago