LanD-FBK / prodigy-datasetLinks
PRODIGy is a collection of dialogues in which each conversation is aligned with speaker profile representations.
☆19Updated 6 months ago
Alternatives and similar repositories for prodigy-dataset
Users that are interested in prodigy-dataset are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆150Updated last year
- ☆72Updated last year
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆90Updated 8 months ago
- Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)☆64Updated last year
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023☆35Updated last year
- ☆43Updated last year
- On Transferability of Prompt Tuning for Natural Language Processing☆99Updated last year
- Unofficial implementation of AlpaGasus☆92Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆127Updated 10 months ago
- Benchmarking LLMs' Emotional Alignment with Humans☆105Updated 5 months ago
- ☆31Updated 8 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆86Updated 11 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆131Updated last year
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆85Updated last year
- [IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection☆88Updated last year
- ☆52Updated last year
- Reverse Instructions to generate instruction tuning data with corpus examples☆214Updated last year
- The Synthetic-Persona-Chat dataset is a synthetically generated persona-based dialogue dataset. It extends the original Persona-Chat data…☆96Updated last year
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆74Updated 2 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆48Updated 7 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆114Updated last year
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆19Updated 8 months ago
- Code and data for the FACTOR paper☆49Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆113Updated last year
- Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation☆83Updated last year
- Code and data associated with the AmbiEnt dataset in "We're Afraid Language Models Aren't Modeling Ambiguity" (Liu et al., 2023)☆64Updated last year
- ☆71Updated 7 months ago
- This is a repository for sharing papers in the field of persona-based conversational AI. The related source code for each paper is linked…☆163Updated last year
- ☆48Updated last year
- [ACL 2024] Code for "MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation"☆38Updated last year