The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.
☆25Jul 2, 2024Updated last year
Alternatives and similar repositories for CleanCoNLL
Users that are interested in CleanCoNLL are comparing it to the libraries listed below
Sorting:
- ☆17Jul 23, 2025Updated 7 months ago
- A dataset for realistic evaluation of noisy label methods☆14Dec 3, 2023Updated 2 years ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆206Updated this week
- A comprehensive benchmark for entity disambiguation☆28Jun 29, 2023Updated 2 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆111May 16, 2024Updated last year
- ☆31Dec 13, 2023Updated 2 years ago
- Efficiently find the best-suited language model (LM) for your NLP task☆135Jul 26, 2025Updated 7 months ago
- PyTorch-IE: State-of-the-art Information Extraction in PyTorch☆77Sep 24, 2025Updated 5 months ago
- Repository for My HuggingFace Natural Language Processing Projects☆31Aug 31, 2023Updated 2 years ago
- European Parliament Open Data - Call for beta testers☆35Nov 11, 2024Updated last year
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆396May 11, 2023Updated 2 years ago
- ☆13Feb 17, 2025Updated last year
- ☆40Aug 11, 2023Updated 2 years ago
- [KDD'23] This is the code repo for our KDD'23 paper "DyGen: Learning from Noisy Labels via Dynamics-Enhanced Generative Modeling".☆11Jun 14, 2023Updated 2 years ago
- An all-in-one R package for the assessment of linguistic similarity☆11Oct 6, 2025Updated 4 months ago
- ☆14Oct 6, 2023Updated 2 years ago
- ☆15Oct 24, 2023Updated 2 years ago
- A repository aimed at sharing links to climate-related resources.☆12Feb 18, 2026Updated last week
- ☆17Updated this week
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 2 months ago
- ☆10Oct 2, 2024Updated last year
- A tutorial on Bayesian multilevel modeling using R and Stan.☆14Nov 19, 2021Updated 4 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- A unit test framework for prompts.☆11Feb 9, 2023Updated 3 years ago
- Fractionation estimation in R package☆10Apr 12, 2020Updated 5 years ago
- ☆11Apr 17, 2023Updated 2 years ago
- Data used in Climate Indicator Project figures and tables☆15Jun 26, 2025Updated 8 months ago
- LCA as Code - Domain-Specific Language for Life-Cycle Analysis☆15Oct 1, 2025Updated 5 months ago
- Extension for pie to include taggers with their models and pre/postprocessors☆11May 30, 2024Updated last year
- Reverse engineering of the FICO algorithm☆13Jan 1, 2023Updated 3 years ago
- Browser based post correction tool for Alto XML files☆14Sep 20, 2013Updated 12 years ago
- Resources used by all of the autometrics implementations☆14Dec 5, 2023Updated 2 years ago
- A toolkit for exhaustively modeling the environmental impact of digital services.☆15Feb 20, 2026Updated last week
- [ICML'25] MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents☆20Jul 31, 2025Updated 7 months ago
- This repository contains my teaching material. Most of it is in German.☆12May 9, 2023Updated 2 years ago
- Code for "Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification", arXiv 2024☆14Jun 24, 2024Updated last year
- ☆14Oct 19, 2025Updated 4 months ago
- ☆13Jul 13, 2021Updated 4 years ago
- ☆14Nov 12, 2025Updated 3 months ago