Jxu-Thu / DITTO
The code of paper "Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation" published at NeurIPS 2022
☆44Updated 2 years ago
Alternatives and similar repositories for DITTO:
Users that are interested in DITTO are comparing it to the libraries listed below
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models☆22Updated 6 months ago
- ☆43Updated 3 years ago
- ☆95Updated 4 months ago
- Code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021.☆51Updated 2 years ago
- Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning☆99Updated last year
- [ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues☆68Updated 6 months ago
- [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models☆96Updated 2 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆55Updated 7 months ago
- ☆85Updated 2 years ago
- Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"☆28Updated last year
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆64Updated this week
- ☆47Updated 10 months ago
- ☆97Updated 2 years ago
- LongMIT: Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets☆35Updated 4 months ago
- ☆52Updated 6 months ago
- Code for paper "Nearest Neighbor Knowledge Distillation for Neural Machine Translation" by Zhixian Yang, Renliang Sun, and Xiaojun Wan. T…☆30Updated 2 years ago
- Code base of In-Context Learning for Dialogue State tracking☆45Updated last year
- Code for ACL2023 paper: Pre-Training to Learn in Context☆108Updated 6 months ago
- Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)☆24Updated 3 years ago
- [EMNLP 2022] Code and data for "Controllable Dialogue Simulation with In-Context Learning"☆34Updated last year
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.☆129Updated last year
- ConTextual Mask Auto-Encoder for Dense Passage Retrieval☆35Updated 3 months ago
- Deeply Supervised, Layer-wise Prediction-aware (DSLP) Transformer for Non-autoregressive Neural Machine Translation☆43Updated last year
- An Experiment on Dynamic NTK Scaling RoPE☆62Updated last year
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆72Updated 8 months ago
- code for Teaching LM to Translate with Comparison☆39Updated last year
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆30Updated last year
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Updated 2 years ago
- ACL'23: Unified Demonstration Retriever for In-Context Learning☆36Updated last year
- ☆21Updated last year