tamuhey / tokenizations
Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/
☆30Updated 3 years ago
Alternatives and similar repositories for tokenizations:
Users that are interested in tokenizations are comparing it to the libraries listed below
- Code for WikiAsp: Multi-document aspect-based summarization.☆41Updated 4 years ago
- Baseline models for the paper: "Modeling Naive Psychology of Characters in Simple Commonsense Stories" by Hannah Rashkin, Antoine Bosselu…☆16Updated 4 years ago
- EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections☆50Updated 3 years ago
- ☆19Updated 5 years ago
- The official implementation of "Distilling Relation Embeddings from Pre-trained Language Models, EMNLP 2021 main conference", a high-qual…☆46Updated 3 months ago
- [EMNLP 2021] Code and data for our paper "Visually Grounded Reasoning across Languages and Cultures"☆30Updated 3 years ago
- Multilingual Compositional Wikidata Questions (MCWQ)☆18Updated last year
- Source code of the paper "Do Syntax Trees Help Pre-trained Transformers Extract Information?" (EACL 2021)☆74Updated 3 years ago
- Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources☆33Updated 2 years ago
- ☆49Updated last year
- A Constrained Text Generation Challenge Towards Generative Commonsense Reasoning☆140Updated last year
- Source code for paper: Knowledge Inheritance for Pre-trained Language Models☆38Updated 2 years ago
- Code for our ACL '20 paper "Representation Engineering with Natural Language Explanations"☆29Updated 4 years ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆101Updated 4 years ago
- Graph Ensemble Learning☆38Updated last year
- ☆97Updated 2 years ago
- ☆77Updated 10 months ago
- The Multitask Long Document Benchmark☆38Updated 2 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆55Updated 2 years ago
- CrossRE: A Cross-Domain Dataset for Relation Extraction (Findings of EMNLP 2022)☆47Updated 7 months ago
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆32Updated 2 years ago
- Code and data for the paper: "Unsupervised Common Sense Question Answering with Self-Talk"☆78Updated 3 years ago
- ☆28Updated last year
- Contrastive Fact Verification☆71Updated 2 years ago
- Codes for ACL-IJCNLP 2021 Paper "Zero-shot Fact Verification by Claim Generation"☆64Updated 3 years ago
- Few-shot NLP benchmark for unified, rigorous eval☆91Updated 2 years ago
- ☆15Updated 3 years ago
- Repository for the Question Answering via Sentence Composition (QASC) dataset☆53Updated last year
- a large scientific paraphrase dataset for longer paraphrase generation☆38Updated 2 years ago
- NLG and NLU for dialogue processing☆42Updated last year