A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
☆1,564Jun 12, 2025Updated 9 months ago
Alternatives and similar repositories for entity-recognition-datasets
Users that are interested in entity-recognition-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)☆345Oct 30, 2022Updated 3 years ago
- An elaborate and exhaustive paper list for Named Entity Recognition (NER)☆392Feb 8, 2022Updated 4 years ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,352Oct 27, 2025Updated 4 months ago
- Pytorch-Named-Entity-Recognition-with-BERT☆1,249May 6, 2021Updated 4 years ago
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,177Aug 28, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,974Jul 28, 2024Updated last year
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆70Jan 27, 2023Updated 3 years ago
- NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character …☆1,896Jun 30, 2022Updated 3 years ago
- Code and data of ACL 2021 paper "Few-NERD: A Few-shot Named Entity Recognition Dataset"☆398Sep 7, 2023Updated 2 years ago
- Use Google's BERT for named entity recognition (CoNLL-2003 as the dataset).☆1,273May 19, 2022Updated 3 years ago
- 📖 A curated list of awesome resources dedicated to Relation Extraction, one of the most important tasks in Natural Language Processing (…☆1,229Jan 27, 2022Updated 4 years ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆396May 11, 2023Updated 2 years ago
- Data augmentation for NLP☆4,652Jun 24, 2024Updated last year
- Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.☆1,720Mar 24, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Learning Named Entity Tagger from Domain-Specific Dictionary☆485Oct 5, 2019Updated 6 years ago
- Open source annotation tool for machine learning practitioners.☆10,583Updated this week
- An open-source NLP research library, built on PyTorch.☆11,893Nov 22, 2022Updated 3 years ago
- CrossWeigh: Training Named Entity Tagger from Imperfect Annotations☆176Jul 25, 2024Updated last year
- Named Entity Recognition as Dependency Parsing☆351Aug 16, 2023Updated 2 years ago
- Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.☆1,484Dec 7, 2022Updated 3 years ago
- Named Entity Recognition data for Europeana Newspapers☆173Apr 5, 2023Updated 2 years ago
- An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the t…☆222Jul 2, 2024Updated last year
- State-of-the-Art Text Embeddings☆18,427Mar 12, 2026Updated last week
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Pytorch implementation of LSTM/BERT-CRF for named entity recognition☆390May 18, 2025Updated 10 months ago
- Named Entity Recognition Tool☆1,174May 27, 2019Updated 6 years ago
- 该repo可用于将OntoNotes-5.0转换为Conll格式☆132Nov 3, 2022Updated 3 years ago
- ☆235Aug 15, 2017Updated 8 years ago
- Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)☆5,970Feb 15, 2023Updated 3 years ago
- This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguatio…☆281Mar 16, 2024Updated 2 years ago
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.☆1,750Dec 20, 2023Updated 2 years ago
- Entity Linker solution☆1,206Sep 21, 2023Updated 2 years ago
- Framework to learn Named Entity Recognition models without labelled data using weak supervision.☆123Apr 19, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP☆12,829Jan 23, 2024Updated 2 years ago
- LUKE -- Language Understanding with Knowledge-based Embeddings☆727Nov 19, 2023Updated 2 years ago
- PyTorch code for SpERT: Span-based Entity and Relation Transformer☆712Feb 1, 2024Updated 2 years ago
- An Open-Source Package for Neural Relation Extraction (NRE)☆4,451Jan 10, 2024Updated 2 years ago
- ✨Fast Coreference Resolution in spaCy with Neural Networks☆2,890Apr 13, 2023Updated 2 years ago
- Autoregressive Entity Retrieval☆796Jul 6, 2023Updated 2 years ago
- 🌊HMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural network model for several NLP tasks based on PyTorch and AllenNLP☆1,195Aug 1, 2023Updated 2 years ago