iwangjian / Midi-Tuning
Code and data for "Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue" (ACL 2024)
☆22Updated 5 months ago
Alternatives and similar repositories for Midi-Tuning:
Users that are interested in Midi-Tuning are comparing it to the libraries listed below
- PyTorch implementation of experiments in the paper Aligning Language Models with Human Preferences via a Bayesian Approach☆30Updated last year
- Code and data for "Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation" (EMNLP 2023…☆30Updated 8 months ago
- Code and data for "Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue" (ACM TOIS)☆10Updated 3 months ago
- Codes for Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback (ACL 2024 Findings)☆13Updated 6 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆76Updated 11 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆44Updated 3 weeks ago
- Code and data for "Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue" (ACL Findings 2023).☆22Updated last year
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆35Updated 3 months ago
- ☆39Updated 2 months ago
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".☆21Updated 3 months ago
- This the implementation of LeCo☆30Updated 6 months ago
- Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process☆22Updated 5 months ago
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"☆19Updated 3 months ago
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆39Updated 10 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆33Updated last year
- InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆64Updated 2 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆71Updated 7 months ago
- Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)☆15Updated last year
- GPT as Human☆18Updated last month
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆53Updated 5 months ago
- The code and data for the paper JiuZhang3.0☆40Updated 7 months ago
- ☆26Updated 3 weeks ago
- ☆56Updated 4 months ago
- [NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models☆33Updated 7 months ago
- code for Preprint paper at Arxiv: MoT: Pre-thinking and Recalling Enable ChatGPT to Self-Improve with Memory-of-Thoughts☆18Updated last year
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆55Updated 6 months ago
- Source code for Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts☆17Updated 4 months ago
- Released code for our ICLR23 paper.☆63Updated last year
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆62Updated 10 months ago
- ☆57Updated 7 months ago