The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.
☆25Jul 2, 2024Updated last year
Alternatives and similar repositories for CleanCoNLL
Users that are interested in CleanCoNLL are comparing it to the libraries listed below
Sorting:
- ☆17Jul 23, 2025Updated 7 months ago
- Efficiently find the best-suited language model (LM) for your NLP task☆135Jul 26, 2025Updated 7 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆110May 16, 2024Updated last year
- Evaluate language models using multiple choice items☆13Mar 6, 2026Updated 2 weeks ago
- A comprehensive benchmark for entity disambiguation☆29Jun 29, 2023Updated 2 years ago
- A dataset for realistic evaluation of noisy label methods☆14Dec 3, 2023Updated 2 years ago
- A general Coverletter supported by typst☆14Dec 11, 2025Updated 3 months ago
- PyTorch-IE: State-of-the-art Information Extraction in PyTorch☆77Sep 24, 2025Updated 5 months ago
- Language Models for Zalando's flair library☆61Jan 20, 2020Updated 6 years ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆206Mar 12, 2026Updated last week
- This repository contains my teaching material. Most of it is in German.☆12May 9, 2023Updated 2 years ago
- CAMeL Dataset☆15Apr 15, 2025Updated 11 months ago
- Command-line tool and Rust library for handling Web ARChive (WARC) files☆28Jun 2, 2025Updated 9 months ago
- ☆14Jan 10, 2021Updated 5 years ago
- Few-Shot Relation Extraction with AllenNLP☆12Jan 27, 2019Updated 7 years ago
- A unit test framework for prompts.☆11Feb 9, 2023Updated 3 years ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆396May 11, 2023Updated 2 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 3 months ago
- Tools for TICCL☆14Dec 12, 2025Updated 3 months ago
- Tutorials for the julia language☆12Feb 4, 2023Updated 3 years ago
- The implementation for "Open Relation Modeling: Learning to Define Relations between Entities" (Findings of ACL '22)☆12Feb 28, 2022Updated 4 years ago
- A collection of annotated biomedical corpora, which can be used for training supervised machine learning methods for various tasks in bio…☆36Sep 18, 2018Updated 7 years ago
- Abstraction and Reasoning Corpus☆14Nov 22, 2022Updated 3 years ago
- ☆10Mar 16, 2023Updated 3 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 6 years ago
- ☆11Jun 23, 2022Updated 3 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Aug 19, 2022Updated 3 years ago
- An opinionated NLP research template☆10Aug 29, 2024Updated last year
- In browser active learning and guided search☆17May 6, 2023Updated 2 years ago
- Browser based post correction tool for Alto XML files☆14Sep 20, 2013Updated 12 years ago
- Unofficial Python client for Azure cognitive search☆11Jun 7, 2019Updated 6 years ago
- ☆31Dec 13, 2023Updated 2 years ago
- A tool for analyzing and visualizing discrete temporal events☆17Aug 15, 2018Updated 7 years ago
- ☆40Aug 11, 2023Updated 2 years ago
- Search the biomedical literature for protein interactions and protein associations☆11Nov 24, 2023Updated 2 years ago
- Per-collection OCR leaderboards using VLM-as-judge☆52Mar 5, 2026Updated 2 weeks ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Aug 2, 2024Updated last year
- ☆18Jan 18, 2026Updated 2 months ago
- European Parliament Open Data - Call for beta testers☆35Nov 11, 2024Updated last year