dhfbk / KIND
KIND: an Italian Multi-Domain Dataset for Named Entity Recognition
☆15Updated last year
Alternatives and similar repositories for KIND:
Users that are interested in KIND are comparing it to the libraries listed below
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆22Updated 2 months ago
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆12Updated last year
- The NLPStatTest project☆12Updated 3 years ago
- Automatically detect errors in annotated corpora.☆47Updated last year
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 2 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆83Updated this week
- ☆26Updated 2 weeks ago
- ☆17Updated 2 years ago
- ☆22Updated 2 years ago
- Code for equipping pretrained language models (BART, GPT-2, XLNet) with commonsense knowledge for generating implicit knowledge statement…☆16Updated 3 years ago
- REMERGE - Multi-Word Expression discovery algorithm☆14Updated 2 years ago
- This repository contains code and data for the EMNLP 2022 paper "CONDAQA: A Contrastive Reading Comprehension Dataset for Reasoning about…☆10Updated 2 years ago
- Semantically Structured Sentence Embeddings☆65Updated 4 months ago
- ☆38Updated 2 months ago
- An easy-to-use API for analyzing INCEpTION annotation projects.☆16Updated last year
- Source code and data for Like a Good Nearest Neighbor☆28Updated 2 months ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- Corpus exploration platform using advanced tools such as interactive summarization and multi document coreference resolution☆12Updated last year
- Tool for parsing and converting various span encoding schemes.☆22Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆18Updated last month
- This repository hosts the code for a tokenizer of tweets.☆12Updated 6 years ago
- GLADIS: A General and Large Acronym Disambiguation Benchmark (EACL 23)☆15Updated 8 months ago
- A repository to keep tools, scripts, data for SMART task.☆11Updated 2 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 10 months ago
- Learned string similarity for entity names using optimal transport.☆35Updated 4 years ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆26Updated 5 months ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- ☆13Updated 3 years ago