tamuhey / tokenizations
Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/
☆29Updated 3 years ago
Alternatives and similar repositories for tokenizations:
Users that are interested in tokenizations are comparing it to the libraries listed below
- ☆19Updated 5 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆55Updated 2 years ago
- ☆31Updated 3 years ago
- The official implementation of "Distilling Relation Embeddings from Pre-trained Language Models, EMNLP 2021 main conference", a high-qual…☆46Updated 4 months ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆102Updated 4 years ago
- ☆97Updated 2 years ago
- ☆49Updated last year
- ☆15Updated 3 years ago
- Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources☆33Updated 2 years ago
- CrossRE: A Cross-Domain Dataset for Relation Extraction (Findings of EMNLP 2022)☆48Updated 8 months ago
- [ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators☆24Updated last year
- 🐸 KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddings☆58Updated 2 years ago
- ☆46Updated 5 years ago
- Graph Ensemble Learning☆38Updated last year
- Repository for the Question Answering via Sentence Composition (QASC) dataset☆54Updated last year
- ☆38Updated last year
- [Work in progress] A reading list for machine commonsense reasoning☆35Updated 5 years ago
- Language model Prompt And Query Archive☆158Updated 3 years ago
- Official repository with code and data accompanying the NAACL 2021 paper "Hurdles to Progress in Long-form Question Answering" (https://a…☆46Updated 2 years ago
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆32Updated 2 years ago
- This repository contains the dataset and the pytorch implementations of the models from the paper CIDER: Commonsense Inference for Dialog…☆27Updated 2 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 3 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated 2 years ago
- Generalizing Natural Language Analysis through Span-relation Representations☆91Updated 2 years ago
- Code for the paper "Learning an Unreferenced Metric for Online Dialogue Evaluation", ACL 2020☆28Updated last year
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"☆31Updated last year
- ☆33Updated last year
- Code and data for "Retrieval Enhanced Model for Commonsense Generation" (ACL-IJCNLP 2021).☆28Updated 3 years ago
- ☆46Updated last year
- ☆77Updated 11 months ago