google-research-datasets / Synthetic-Persona-ChatLinks
The Synthetic-Persona-Chat dataset is a synthetically generated persona-based dialogue dataset. It extends the original Persona-Chat dataset.
β99Updated last year
Alternatives and similar repositories for Synthetic-Persona-Chat
Users that are interested in Synthetic-Persona-Chat are comparing it to the libraries listed below
Sorting:
- NAACL 2024. Code & Dataset for "π Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakeβ¦β43Updated last year
- β96Updated last year
- β52Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"β86Updated last year
- A set of utilities for running few-shot prompting experiments on large-language modelsβ122Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.β153Updated last year
- β57Updated 11 months ago
- β127Updated 11 months ago
- [NeurIPS 2023] Codebase for the paper: "Guiding Large Language Models with Directional Stimulus Prompting"β113Updated 2 years ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"β116Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.β116Updated 2 years ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersβ133Updated last year
- Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generationβ85Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".β111Updated 3 months ago
- Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.β105Updated last year
- Codebase accompanying the Summary of a Haystack paper.β79Updated last year
- β74Updated last year
- DialOp: Decision-oriented dialogue environments for collaborative language agentsβ109Updated 10 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [Fβ¦β67Updated last year
- β52Updated 2 years ago
- Reverse Instructions to generate instruction tuning data with corpus examplesβ215Updated last year
- PRODIGy is a collection of dialogues in which each conversation is aligned with speaker profile representations.β19Updated 8 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?β161Updated last year
- A package to generate summaries of long-form text and evaluate the coherence of these summaries. Official package for our ICLR 2024 paperβ¦β124Updated 11 months ago
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β78Updated 11 months ago
- Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)β162Updated last year
- β68Updated 2 years ago
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedbackβ207Updated 2 years ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)β143Updated 10 months ago
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"β96Updated last year