Moocember / Optimization-by-PROmpting
☆78Updated last year
Alternatives and similar repositories for Optimization-by-PROmpting
Users that are interested in Optimization-by-PROmpting are comparing it to the libraries listed below
Sorting:
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆81Updated 9 months ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆96Updated last year
- ☆120Updated 7 months ago
- ☆121Updated 11 months ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆57Updated last year
- Self-Alignment with Principle-Following Reward Models☆161Updated last week
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆111Updated last year
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆56Updated 11 months ago
- augmented LLM with self reflection☆121Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- ☆44Updated 11 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆54Updated 7 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆85Updated last month
- Benchmarking LLMs with Challenging Tasks from Real Users☆222Updated 6 months ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆159Updated last year
- Simple next-token-prediction for RLHF☆225Updated last year
- Open Implementations of LLM Analyses☆103Updated 7 months ago
- Reformatted Alignment☆114Updated 7 months ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆150Updated last year
- ☆147Updated last year
- Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation☆76Updated last year
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated last year
- ☆129Updated 6 months ago
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆95Updated last year
- ☆181Updated 3 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆108Updated last week
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆135Updated 6 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- Critique-out-Loud Reward Models☆64Updated 6 months ago