The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.
☆25Jul 2, 2024Updated last year
Alternatives and similar repositories for CleanCoNLL
Users that are interested in CleanCoNLL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Jul 23, 2025Updated 9 months ago
- Efficiently find the best-suited language model (LM) for your NLP task☆135Jul 26, 2025Updated 9 months ago
- A comprehensive benchmark for entity disambiguation☆29Jun 29, 2023Updated 2 years ago
- A dataset for realistic evaluation of noisy label methods☆15Dec 3, 2023Updated 2 years ago
- Extension for pie to include taggers with their models and pre/postprocessors☆11May 30, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆13Mar 31, 2026Updated last month
- PyTorch-IE: State-of-the-art Information Extraction in PyTorch☆77Sep 24, 2025Updated 7 months ago
- Language Models for Zalando's flair library☆61Jan 20, 2020Updated 6 years ago
- This repository contains my teaching material. Most of it is in German.☆13May 9, 2023Updated 2 years ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆211Mar 12, 2026Updated last month
- Bias correction for richness in abundance data☆12Apr 20, 2026Updated last week
- CAMeL Dataset☆15Apr 15, 2025Updated last year
- Command-line tool and Rust library for handling Web ARChive (WARC) files☆30Jun 2, 2025Updated 10 months ago
- Few-Shot Relation Extraction with AllenNLP☆12Jan 27, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A unit test framework for prompts.☆11Feb 9, 2023Updated 3 years ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆396May 11, 2023Updated 2 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 4 months ago
- ☆10Mar 16, 2023Updated 3 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 6 years ago
- The implementation for "Open Relation Modeling: Learning to Define Relations between Entities" (Findings of ACL '22)☆12Feb 28, 2022Updated 4 years ago
- ☆11Jun 23, 2022Updated 3 years ago
- An opinionated NLP research template☆10Aug 29, 2024Updated last year
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Aug 19, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Browser based post correction tool for Alto XML files☆14Sep 20, 2013Updated 12 years ago
- Unofficial Python client for Azure cognitive search☆11Jun 7, 2019Updated 6 years ago
- ☆31Dec 13, 2023Updated 2 years ago
- A tool for analyzing and visualizing discrete temporal events☆17Aug 15, 2018Updated 7 years ago
- Python implementation of the random-walk inductive classification algorithm Modified Adsorption from P. Talukdar☆15Jul 30, 2014Updated 11 years ago
- A pure-Python Beaker client☆18Oct 14, 2025Updated 6 months ago
- ☆40Aug 11, 2023Updated 2 years ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Aug 2, 2024Updated last year
- Search the biomedical literature for protein interactions and protein associations☆11Nov 24, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- European Parliament Open Data - Call for beta testers☆35Nov 11, 2024Updated last year
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆11Mar 10, 2019Updated 7 years ago
- MedDistant19: Towards an Accurate Benchmark for Broad-Coverage Biomedical Relation Extraction (COLING 2022)☆18Oct 13, 2022Updated 3 years ago
- A multilingual DeBERTa model fine-tuned on political communication to classify discrete emotions☆16Nov 10, 2023Updated 2 years ago
- A package for generating synthetic data and fine-tuning a gliner model.☆15Jun 5, 2024Updated last year
- Website for Applied Language Technology courses at the University of Helsinki☆19Aug 12, 2022Updated 3 years ago
- LowFER: Low-rank Bilinear Pooling for Link Prediction (ICML 2020)☆13Sep 24, 2022Updated 3 years ago