uclaml / Rephrase-and-Respond
Official repo of Respond-and-Respond: data, code, and evaluation
☆104Updated 7 months ago
Alternatives and similar repositories for Rephrase-and-Respond:
Users that are interested in Rephrase-and-Respond are comparing it to the libraries listed below
- Codebase accompanying the Summary of a Haystack paper.☆76Updated 6 months ago
- ☆142Updated 11 months ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆109Updated last year
- 🚢 Data Toolkit for Sailor Language Models☆88Updated last month
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆95Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆105Updated 6 months ago
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆61Updated last year
- ☆120Updated 9 months ago
- Evaluating LLMs with CommonGen-Lite☆89Updated last year
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆85Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".☆102Updated 5 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆102Updated 3 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆131Updated 4 months ago
- Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"☆27Updated last month
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆75Updated last year
- Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts…☆193Updated 8 months ago
- Evaluating LLMs with fewer examples☆148Updated 11 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆219Updated 4 months ago
- This is the official repository for Inheritune.☆111Updated last month
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆82Updated last year
- ☆150Updated last year
- ☆119Updated 6 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- ☆51Updated 8 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 5 months ago
- A set of utilities for running few-shot prompting experiments on large-language models☆118Updated last year
- ☆36Updated 2 years ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆151Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆52Updated 3 months ago
- The first dense retrieval model that can be prompted like an LM☆68Updated 6 months ago