jinlanfu / Polyglot_PromptLinks
Code and dataset for Polyglot Prompting: Multilingual Multitask Prompt Training.
☆18Updated 2 years ago
Alternatives and similar repositories for Polyglot_Prompt
Users that are interested in Polyglot_Prompt are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)☆74Updated 3 years ago
- ☆45Updated 4 years ago
- ☆20Updated last year
- ACL 2022: Just Rank: Rethinking Evaluation with Word and Sentence Similarities☆35Updated 2 years ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆81Updated last year
- ☆11Updated 3 years ago
- Implementation of our paper "Self-training Sampling with Monolingual Data Uncertainty for Neural Machine Translation" to appear in ACL-20…☆31Updated 4 years ago
- ☆21Updated 2 years ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆22Updated last year
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))☆13Updated 4 years ago
- ☆41Updated 4 years ago
- Improving Word Translation via Two-Stage Contrastive Learning (ACL 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-…☆35Updated 10 months ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 3 years ago
- Introduction and scripts for ACL-2020 paper "On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation"☆21Updated 5 years ago
- Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)☆30Updated 3 years ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆37Updated 2 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Updated 2 years ago
- This repository contains the code for "How many data points is a prompt worth?"☆48Updated 4 years ago
- The source code of our ACL paper "A Training-free and Reference-free Summarization Evaluation Metric via Centrality-weighted Relevance an…☆14Updated 2 years ago
- Detect hallucinated tokens for conditional sequence generation.☆64Updated 3 years ago
- ☆22Updated 3 years ago
- Source code for the ACL-IJCNLP 2021 paper entitled "T-DNA: Taming Pre-trained Language Models with N-gram Representations for Low-Resourc…☆19Updated 2 years ago
- Instruction to data diversification☆24Updated 5 years ago
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆69Updated 3 years ago
- ☆25Updated 3 years ago
- ☆38Updated 4 years ago
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Updated 8 months ago
- Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".☆58Updated 5 years ago
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Updated 3 years ago
- Code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021.☆57Updated 3 years ago