explosion / spacy-alignments
π« A spaCy package for Yohei Tamura's Rust tokenizations library
β29Updated last year
Alternatives and similar repositories for spacy-alignments:
Users that are interested in spacy-alignments are comparing it to the libraries listed below
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.β44Updated 11 months ago
- allennlp-light is a port of AllenNLP's core modules and nn portions into a standalone package with minimum dependenciesβ56Updated 2 years ago
- Bayesian Assessment of Hypothesesβ24Updated last year
- β17Updated 2 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchiβ¦β33Updated 11 months ago
- MultiCite code and data. Models are available on Huggingface.β31Updated 2 years ago
- Code for SaGe subword tokenizer (EACL 2023)β24Updated 4 months ago
- β33Updated 2 weeks ago
- Data Programming by Demonstration (DPBD) for Document Classificationβ35Updated 3 years ago
- Tool for parsing and converting various span encoding schemes.β23Updated last year
- Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selectionβ15Updated 3 years ago
- β27Updated last month
- Converter from UD-trees to BART representationβ36Updated last year
- β46Updated 3 years ago
- β16Updated 4 months ago
- Train transformer-based models.β28Updated 3 weeks ago
- Generate BERT vocabularies and pretraining examples from Wikipediasβ18Updated 4 years ago
- β45Updated 3 years ago
- β10Updated 4 years ago
- β‘οΈ AllenNLP plugin for adding subcommands to use Optuna, making hyperparameter optimization easyβ33Updated 3 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.β68Updated 3 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.β45Updated last year
- The NLPStatTest projectβ12Updated 3 years ago
- A Test Collection of Computer Science Papers for Faceted Query by Exampleβ21Updated 3 years ago
- An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For instβ¦β22Updated 3 years ago
- One-stop shop for running and fine-tuning transformer-based language models for retrievalβ53Updated last week
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/β86Updated last week
- β75Updated 3 years ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating themβ37Updated 2 years ago
- Statistics on multilingual datasetsβ17Updated 2 years ago