A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
☆1,567Jun 12, 2025Updated 10 months ago
Alternatives and similar repositories for entity-recognition-datasets
Users that are interested in entity-recognition-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)☆345Oct 30, 2022Updated 3 years ago
- An elaborate and exhaustive paper list for Named Entity Recognition (NER)☆392Feb 8, 2022Updated 4 years ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,363Oct 27, 2025Updated 5 months ago
- Pytorch-Named-Entity-Recognition-with-BERT☆1,249May 6, 2021Updated 4 years ago
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,182Aug 28, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,974Jul 28, 2024Updated last year
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆71Jan 27, 2023Updated 3 years ago
- NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character …☆1,897Jun 30, 2022Updated 3 years ago
- Code and data of ACL 2021 paper "Few-NERD: A Few-shot Named Entity Recognition Dataset"☆401Sep 7, 2023Updated 2 years ago
- Use Google's BERT for named entity recognition (CoNLL-2003 as the dataset).☆1,275May 19, 2022Updated 3 years ago
- 📖 A curated list of awesome resources dedicated to Relation Extraction, one of the most important tasks in Natural Language Processing (…☆1,228Jan 27, 2022Updated 4 years ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆396May 11, 2023Updated 2 years ago
- Data augmentation for NLP☆4,656Jun 24, 2024Updated last year
- Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.☆1,723Mar 24, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Learning Named Entity Tagger from Domain-Specific Dictionary☆485Oct 5, 2019Updated 6 years ago
- Open source annotation tool for machine learning practitioners.☆10,609Apr 9, 2026Updated last week
- An open-source NLP research library, built on PyTorch.☆11,893Nov 22, 2022Updated 3 years ago
- CrossWeigh: Training Named Entity Tagger from Imperfect Annotations☆176Jul 25, 2024Updated last year
- Named Entity Recognition as Dependency Parsing☆352Aug 16, 2023Updated 2 years ago
- Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.☆1,485Dec 7, 2022Updated 3 years ago
- Named Entity Recognition data for Europeana Newspapers☆173Apr 5, 2023Updated 3 years ago
- An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the t…☆222Jul 2, 2024Updated last year
- State-of-the-Art Text Embeddings☆18,534Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Pytorch implementation of LSTM/BERT-CRF for named entity recognition☆392May 18, 2025Updated 10 months ago
- Named Entity Recognition Tool☆1,174May 27, 2019Updated 6 years ago
- 该repo可用于将OntoNotes-5.0转换为Conll格式☆132Nov 3, 2022Updated 3 years ago
- ☆234Aug 15, 2017Updated 8 years ago
- Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)☆5,971Feb 15, 2023Updated 3 years ago
- This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguatio…☆281Mar 16, 2024Updated 2 years ago
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.☆1,754Dec 20, 2023Updated 2 years ago
- Entity Linker solution☆1,206Sep 21, 2023Updated 2 years ago
- Framework to learn Named Entity Recognition models without labelled data using weak supervision.☆123Apr 19, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP☆12,835Jan 23, 2024Updated 2 years ago
- LUKE -- Language Understanding with Knowledge-based Embeddings☆727Nov 19, 2023Updated 2 years ago
- PyTorch code for SpERT: Span-based Entity and Relation Transformer☆713Feb 1, 2024Updated 2 years ago
- An Open-Source Package for Neural Relation Extraction (NRE)☆4,454Jan 10, 2024Updated 2 years ago
- ✨Fast Coreference Resolution in spaCy with Neural Networks☆2,891Apr 13, 2023Updated 3 years ago
- Autoregressive Entity Retrieval☆798Jul 6, 2023Updated 2 years ago
- 🌊HMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural network model for several NLP tasks based on PyTorch and AllenNLP☆1,195Aug 1, 2023Updated 2 years ago