Moocember / Optimization-by-PROmpting
☆75Updated 11 months ago
Related projects: ⓘ
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆55Updated last week
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆84Updated 11 months ago
- Chain-of-Hindsight, A Scalable RLHF Method☆213Updated 11 months ago
- ☆118Updated 5 months ago
- ☆105Updated this week
- ☆111Updated 3 months ago
- Official implementation for the paper "LongEmbed: Extending Embedding Models for Long Context Retrieval"☆108Updated 4 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆81Updated last month
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆134Updated 6 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆182Updated last month
- ☆87Updated 2 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆89Updated 4 months ago
- Self-Alignment with Principle-Following Reward Models☆144Updated 6 months ago
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆48Updated 3 weeks ago
- augmented LLM with self reflection☆80Updated 9 months ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆133Updated 10 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆123Updated 6 months ago
- Codebase accompanying the Summary of a Haystack paper.☆65Updated 2 months ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆151Updated 4 months ago
- Code and example data for the paper: Rule Based Rewards for Language Model Safety☆131Updated 2 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆82Updated 2 months ago
- ☆79Updated 3 months ago
- Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation☆75Updated last year
- ☆50Updated 2 months ago
- Official implementation for the paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention M…☆95Updated last month
- ☆87Updated 3 months ago
- PASTA: Post-hoc Attention Steering for LLMs☆96Updated last week
- ☆82Updated 3 weeks ago
- An Analytical Evaluation Board of Multi-turn LLM Agents☆227Updated 4 months ago
- Attribute (or cite) statements generated by LLMs back to in-context information.☆107Updated 2 weeks ago