tamuhey / tokenizationsLinks
Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/
☆29Updated 3 years ago
Alternatives and similar repositories for tokenizations
Users that are interested in tokenizations are comparing it to the libraries listed below
Sorting:
- Baseline models for the paper: "Modeling Naive Psychology of Characters in Simple Commonsense Stories" by Hannah Rashkin, Antoine Bosselu…☆16Updated 4 years ago
- CrossRE: A Cross-Domain Dataset for Relation Extraction (Findings of EMNLP 2022)☆48Updated 9 months ago
- ☆31Updated last year
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆103Updated 4 years ago
- ☆19Updated 5 years ago
- Code for ModularQA☆28Updated 3 years ago
- The official implementation of "Distilling Relation Embeddings from Pre-trained Language Models, EMNLP 2021 main conference", a high-qual…☆47Updated 6 months ago
- [Work in progress] A reading list for machine commonsense reasoning☆35Updated 5 years ago
- EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections☆50Updated 3 years ago
- ☆30Updated 3 years ago
- This repository contains the dataset and the pytorch implementations of the models from the paper CIDER: Commonsense Inference for Dialog…☆27Updated 2 years ago
- ☆49Updated last year
- Code and CoarseWSD-20 datasets for "Language Models and Word Sense Disambiguation: An Overview and Analysis"☆25Updated 3 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- ☆28Updated last year
- Repository for the Question Answering via Sentence Composition (QASC) dataset☆56Updated last year
- ☆16Updated last year
- A web application for playing 20 Questions to crowdsource common sense. 🤖☆15Updated 2 years ago
- Source code of the paper "Do Syntax Trees Help Pre-trained Transformers Extract Information?" (EACL 2021)☆75Updated 3 years ago
- Code for WikiAsp: Multi-document aspect-based summarization.☆41Updated 4 years ago
- ☆44Updated last year
- a corpus containing 4.5K conversations from the Conversational Question-Answering dataset CoQA, for a total of 53K follow-up question-ans…☆16Updated last year
- Multilingual Compositional Wikidata Questions (MCWQ)☆18Updated last year
- ☆97Updated 2 years ago
- Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources☆33Updated 2 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 3 years ago
- DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence☆34Updated last year
- Contrastive Fact Verification☆71Updated 2 years ago
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…☆14Updated 5 years ago
- A benchmark dataset for evaluating dialog system and natural language generation metrics.☆37Updated 2 years ago