flairNLP / CleanCoNLL
The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.
☆23Updated 8 months ago
Alternatives and similar repositories for CleanCoNLL:
Users that are interested in CleanCoNLL are comparing it to the libraries listed below
- Repository for the paper "Named Entity Recognition for Entity Linking: What Works and What's Next" (EMNLP 2021).☆75Updated 3 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆85Updated 2 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆90Updated last year
- 🧪 Cutting-edge experimental spaCy components and features☆96Updated 10 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- Automatically detect errors in annotated corpora.☆47Updated last year
- Source code and data for Like a Good Nearest Neighbor☆28Updated 2 months ago
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatio…☆44Updated last year
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆105Updated 10 months ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆83Updated this week
- Bi-encoder Based Entity Linking Tutorial. You can run experiment only in 5 minutes. Experiments on Co-lab pro GPU are also supported!☆34Updated 3 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆37Updated 3 years ago
- ☆38Updated 2 months ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- Robust and fast topic models with sentence-transformers.☆44Updated 3 weeks ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆22Updated 2 months ago
- Tool for parsing and converting various span encoding schemes.☆22Updated last year
- This repository hosts the dataset for the paper Computer Science Named Entity Recognition in the Open Research Knowledge Graph☆21Updated last year
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated last year
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆121Updated 10 months ago
- MultiCite code and data. Models are available on Huggingface.☆31Updated 2 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆24Updated 3 months ago
- ☆45Updated 2 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 10 months ago
- Semantically Structured Sentence Embeddings☆65Updated 4 months ago
- A High-level Library for Named Entity Recognition in Python.☆23Updated last year
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated last year
- ☆28Updated last year
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆51Updated last year