LanD-FBK / prodigy-datasetLinks
PRODIGy is a collection of dialogues in which each conversation is aligned with speaker profile representations.
☆19Updated 11 months ago
Alternatives and similar repositories for prodigy-dataset
Users that are interested in prodigy-dataset are comparing it to the libraries listed below
Sorting:
- Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)☆64Updated 2 years ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆156Updated 2 years ago
- On Transferability of Prompt Tuning for Natural Language Processing☆100Updated last year
- Data and code for the paper "Inducing Positive Perspectives with Text Reframing"☆61Updated 2 years ago
- Reverse Instructions to generate instruction tuning data with corpus examples☆216Updated last year
- ☆34Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆117Updated 2 years ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆135Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆52Updated 4 months ago
- ☆68Updated 2 years ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆86Updated last year
- ☆75Updated last year
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆66Updated last year
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…☆76Updated last year
- ☆159Updated last year
- ☆53Updated last year
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆165Updated 2 years ago
- Unofficial implementation of AlpaGasus☆93Updated 2 years ago
- The model implementations for T5 encoder decoder soft prompt tuning for text generation.☆25Updated 3 years ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆89Updated last year
- The Synthetic-Persona-Chat dataset is a synthetically generated persona-based dialogue dataset. It extends the original Persona-Chat data…☆105Updated last year
- This is a repository for sharing papers in the field of persona-based conversational AI. The related source code for each paper is linked…☆168Updated last year
- [IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection☆90Updated last year
- ☆47Updated 2 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆95Updated last year
- [NeurIPS 2024] A comprehensive benchmark for evaluating critique ability of LLMs☆48Updated last year
- [ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"☆53Updated last year
- RARR: Researching and Revising What Language Models Say, Using Language Models☆49Updated 2 years ago
- MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation☆28Updated last year
- Token-level Reference-free Hallucination Detection☆97Updated 2 years ago