A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
☆1,561Jun 12, 2025Updated 8 months ago
Alternatives and similar repositories for entity-recognition-datasets
Users that are interested in entity-recognition-datasets are comparing it to the libraries listed below
Sorting:
- Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)☆345Oct 30, 2022Updated 3 years ago
- An elaborate and exhaustive paper list for Named Entity Recognition (NER)☆392Feb 8, 2022Updated 4 years ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,354Oct 27, 2025Updated 4 months ago
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,175Aug 28, 2024Updated last year
- Pytorch-Named-Entity-Recognition-with-BERT☆1,248May 6, 2021Updated 4 years ago
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,981Jul 28, 2024Updated last year
- NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character …☆1,897Jun 30, 2022Updated 3 years ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆396May 11, 2023Updated 2 years ago
- 📖 A curated list of awesome resources dedicated to Relation Extraction, one of the most important tasks in Natural Language Processing (…☆1,227Jan 27, 2022Updated 4 years ago
- Data augmentation for NLP☆4,645Jun 24, 2024Updated last year
- Use Google's BERT for named entity recognition (CoNLL-2003 as the dataset).☆1,273May 19, 2022Updated 3 years ago
- Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.☆1,721Mar 24, 2023Updated 2 years ago
- Learning Named Entity Tagger from Domain-Specific Dictionary☆485Oct 5, 2019Updated 6 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆70Jan 27, 2023Updated 3 years ago
- An open-source NLP research library, built on PyTorch.☆11,889Nov 22, 2022Updated 3 years ago
- Open source annotation tool for machine learning practitioners.☆10,555Feb 17, 2026Updated 2 weeks ago
- Code and data of ACL 2021 paper "Few-NERD: A Few-shot Named Entity Recognition Dataset"☆398Sep 7, 2023Updated 2 years ago
- CrossWeigh: Training Named Entity Tagger from Imperfect Annotations☆176Jul 25, 2024Updated last year
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.☆1,752Dec 20, 2023Updated 2 years ago
- Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)☆5,971Feb 15, 2023Updated 3 years ago
- Named Entity Recognition as Dependency Parsing☆351Aug 16, 2023Updated 2 years ago
- State-of-the-Art Text Embeddings☆18,323Updated this week
- An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the t…☆222Jul 2, 2024Updated last year
- Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.☆1,484Dec 7, 2022Updated 3 years ago
- LUKE -- Language Understanding with Knowledge-based Embeddings☆727Nov 19, 2023Updated 2 years ago
- Entity Linker solution☆1,206Sep 21, 2023Updated 2 years ago
- Pytorch implementation of LSTM/BERT-CRF for named entity recognition☆390May 18, 2025Updated 9 months ago
- Named Entity Recognition Tool☆1,173May 27, 2019Updated 6 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- ✨Fast Coreference Resolution in spaCy with Neural Networks☆2,892Apr 13, 2023Updated 2 years ago
- 🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP☆12,817Jan 23, 2024Updated 2 years ago
- Autoregressive Entity Retrieval☆797Jul 6, 2023Updated 2 years ago
- Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conve…☆4,231Aug 25, 2025Updated 6 months ago
- 🌊HMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural network model for several NLP tasks based on PyTorch and AllenNLP☆1,194Aug 1, 2023Updated 2 years ago
- Framework to learn Named Entity Recognition models without labelled data using weak supervision.☆124Apr 19, 2021Updated 4 years ago
- PyTorch code for SpERT: Span-based Entity and Relation Transformer☆712Feb 1, 2024Updated 2 years ago
- Language-Agnostic SEntence Representations☆3,659May 2, 2024Updated last year
- brat rapid annotation tool (brat) - for all your textual annotation needs☆1,875Jul 3, 2024Updated last year
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,050Jan 9, 2024Updated 2 years ago