☆53Apr 29, 2020Updated 5 years ago
Alternatives and similar repositories for loss_dropper
Users that are interested in loss_dropper are comparing it to the libraries listed below
Sorting:
- Posterior Control of Blackbox Generation☆23May 2, 2020Updated 5 years ago
- This repository contains the script to compute the questions based on the Answerability aspect.☆38Nov 12, 2019Updated 6 years ago
- (AAAI'20) The source code for the paper "Controlling the Amount of Verbatim Copying in Abstractive Summarization".☆38Oct 14, 2020Updated 5 years ago
- Neural Text Generation with Unlikelihood Training☆310Aug 31, 2021Updated 4 years ago
- ☆42Jan 11, 2021Updated 5 years ago
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Mar 30, 2023Updated 2 years ago
- ☆13Sep 27, 2022Updated 3 years ago
- Official codes for COLING 2024 paper "Robust and Scalable Model Editing for Large Language Models": https://arxiv.org/abs/2403.17431v1☆14Mar 27, 2024Updated last year
- ☆12Feb 18, 2020Updated 6 years ago
- QuoteSum is a textual QA dataset containing Semi-Extractive Multi-source Question Answering (SEMQA) examples written by humans, based on …☆13Mar 25, 2024Updated last year
- [ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs☆17May 21, 2025Updated 9 months ago
- Code for the paper-"Mirostat: A Perplexity-Controlled Neural Text Decoding Algorithm" (https://arxiv.org/abs/2007.14966).☆61Feb 7, 2022Updated 4 years ago
- Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.☆51Mar 17, 2020Updated 5 years ago
- ☆12Jan 2, 2022Updated 4 years ago
- ☆44Jul 29, 2019Updated 6 years ago
- An example application of neural network distillation to MNIST☆11Sep 29, 2016Updated 9 years ago
- ☆50Feb 5, 2023Updated 3 years ago
- Video classification, youtube8m, Knowledge distillation, Tensorflow, NeXtVLAD☆27Sep 5, 2019Updated 6 years ago
- ☆27Jul 29, 2023Updated 2 years ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆211Nov 20, 2023Updated 2 years ago
- NAACL 2021 - Progressive Generation of Long Text☆82Oct 2, 2020Updated 5 years ago
- Paraphrasing for academic texts☆14Dec 8, 2022Updated 3 years ago
- Masking tokens to modify the predictions of a pretrained sentence classifier☆16Feb 4, 2020Updated 6 years ago
- ☆17Aug 13, 2024Updated last year
- English or Chinses GPT2Dialog model from GPT2-chitchat☆12Feb 23, 2020Updated 6 years ago
- ☆13Apr 8, 2019Updated 6 years ago
- Scripts to create the MLB dataset introduced in the paper Data-to-text Generation with Entity Modeling☆14Feb 9, 2021Updated 5 years ago
- Thin wrapper for the AllenNLP's implementation of supervised open information extraction☆17Nov 19, 2019Updated 6 years ago
- A simple web-based interface for ChatGPT.☆12Jul 1, 2023Updated 2 years ago
- Code for the EMNLP'21 paper "Neural Path Hunter: Reducing Hallucination in Dialogue Systems via Path Grounding"☆16Mar 13, 2022Updated 3 years ago
- ☆17Mar 15, 2023Updated 2 years ago
- Cascaded Text Generation with Markov Transformers☆130Mar 20, 2023Updated 2 years ago
- Code for "Adversarial Over-Sensitivity and Over-Stability Strategies for Dialogue Models (CoNLL 2018)"☆15Feb 6, 2019Updated 7 years ago
- Research code for ACL 2020 paper: "Distilling Knowledge Learned in BERT for Text Generation".☆129Jun 30, 2021Updated 4 years ago
- Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆50Jun 30, 2025Updated 8 months ago
- Cleaned up version of the PlotMachines code☆68Jun 12, 2023Updated 2 years ago
- Streamlit, but better.☆16Feb 5, 2024Updated 2 years ago
- QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking☆36Jan 20, 2026Updated last month
- Code for "Simulated Multiple Reference Training Improves Low-Resource Machine Translation"☆15Dec 1, 2020Updated 5 years ago