tamuhey / tokenizationsLinks
Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/
☆29Updated 4 years ago
Alternatives and similar repositories for tokenizations
Users that are interested in tokenizations are comparing it to the libraries listed below
Sorting:
- Repository for the Question Answering via Sentence Composition (QASC) dataset☆56Updated 2 years ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆104Updated 4 years ago
- Multi-XScience: A Large-scale Dataset for Extreme Multi-document Summarization of Scientific Articles☆48Updated last year
- ☆97Updated 3 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆57Updated 3 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆174Updated 4 years ago
- Code and data for the paper: "Unsupervised Common Sense Question Answering with Self-Talk"☆79Updated 4 years ago
- This repository maintains the QAConv dataset, a question-answering dataset on informative conversations including business emails, panel …☆84Updated last year
- Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"☆68Updated 4 years ago
- Resources for the "CTRLsum: Towards Generic Controllable Text Summarization" paper☆147Updated 8 months ago
- FactSumm: Factual Consistency Scorer for Abstractive Summarization☆113Updated 2 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Updated 5 years ago
- An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"☆120Updated 3 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 3 years ago
- Code for WikiAsp: Multi-document aspect-based summarization.☆43Updated 5 years ago
- Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)☆80Updated 2 years ago
- Data and Code for Paper "Reflect Not Reflex: Inference-Based Common Ground Improves Dialogue Response Quality" (EMNLP 2022)☆11Updated 3 years ago
- Neural models of common sense. 🤖☆98Updated 2 years ago
- ☆47Updated 2 years ago
- Few-shot NLP benchmark for unified, rigorous eval☆93Updated 3 years ago
- ☆39Updated 3 years ago
- ☆68Updated 8 months ago
- ☆50Updated 2 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆209Updated 4 years ago
- EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections☆52Updated 4 years ago
- PyTorch original implementation of "Unsupervised Question Decomposition for Question Answering"☆122Updated 2 years ago
- The official implementation of "Distilling Relation Embeddings from Pre-trained Language Models, EMNLP 2021 main conference", a high-qual…☆47Updated last year
- Language model Prompt And Query Archive☆160Updated 4 years ago
- A BART version of an open-domain QA model in a closed-book setup☆119Updated 5 years ago
- 🐸 KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddings☆58Updated 3 years ago