tamuhey / textspanLinks
Text span utilities for Rust and Python
☆22Updated 2 years ago
Alternatives and similar repositories for textspan
Users that are interested in textspan are comparing it to the libraries listed below
Sorting:
- ☘️ Code for Convex Aggregation for Opinion Summarization (Iso et al; Findings of EMNLP 2021)☆35Updated 3 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆79Updated 3 years ago
- ⚡️ AllenNLP plugin for adding subcommands to use Optuna, making hyperparameter optimization easy☆32Updated 4 years ago
- A multilingual version of MS MARCO passage ranking dataset☆145Updated 2 years ago
- Repro is a library for easily running code from published papers via Docker.☆41Updated 2 years ago
- ☆10Updated 3 years ago
- ☆31Updated 2 years ago
- A simple implementation of SimCSE☆77Updated 3 years ago
- JaQuAD: Japanese Question Answering Dataset for Machine Reading Comprehension (2022, Skelter Labs)☆108Updated 3 years ago
- Utility scripts for preprocessing Wikipedia texts for NLP☆78Updated last year
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Updated 3 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆54Updated 2 years ago
- Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/☆193Updated 2 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆57Updated 3 years ago
- A template for starting a new allennlp project using config files and `allennlp train`☆38Updated last year
- A library for evaluation of Grammatical Error Correction (GEC). Accepted to ACL'25 Demo: "gec-metrics: A Unified Library for Grammatical …☆14Updated 4 months ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Updated 3 years ago
- The NLPStatTest project☆12Updated 3 years ago
- NIILC QA data☆18Updated 10 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆96Updated 9 months ago
- X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents (JCDL 2022)☆14Updated 3 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆148Updated 3 years ago
- Pretraining scripts for BART transformer model☆12Updated 2 years ago
- SciGen☆24Updated 4 years ago
- ☆60Updated 3 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 3 months ago
- MFAQ: a Multilingual FAQ Dataset☆18Updated 2 years ago
- ☆20Updated 4 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆100Updated 2 years ago
- GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning☆31Updated 4 years ago