LanD-FBK / prodigy-datasetLinks
PRODIGy is a collection of dialogues in which each conversation is aligned with speaker profile representations.
☆19Updated 7 months ago
Alternatives and similar repositories for prodigy-dataset
Users that are interested in prodigy-dataset are comparing it to the libraries listed below
Sorting:
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆113Updated last year
- ☆73Updated last year
- On Transferability of Prompt Tuning for Natural Language Processing☆99Updated last year
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023☆36Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆153Updated last year
- ☆53Updated last year
- Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)☆64Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".☆108Updated 2 months ago
- The model implementations for T5 encoder decoder soft prompt tuning for text generation.☆24Updated 2 years ago
- [IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection☆88Updated last year
- ☆154Updated last year
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆85Updated last year
- ☆71Updated 8 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆146Updated 9 months ago
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…☆74Updated last year
- Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation☆83Updated last year
- Benchmarking LLMs' Emotional Alignment with Humans☆107Updated 6 months ago
- ☆125Updated 10 months ago
- Unofficial implementation of AlpaGasus☆92Updated last year
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆75Updated 2 months ago
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆96Updated last year
- ☆140Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆86Updated last year
- PASTA: Post-hoc Attention Steering for LLMs☆122Updated 8 months ago
- ☆49Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆49Updated 8 months ago
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆81Updated last year
- 🚢 Data Toolkit for Sailor Language Models☆94Updated 5 months ago
- Code and Data for EMNLP 2024 Paper "Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent"☆130Updated 3 weeks ago
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆219Updated 2 years ago