The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.
☆25Jul 2, 2024Updated last year
Alternatives and similar repositories for CleanCoNLL
Users that are interested in CleanCoNLL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Jul 23, 2025Updated 11 months ago
- Efficiently find the best-suited language model (LM) for your NLP task☆134Jul 26, 2025Updated 11 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆110May 16, 2024Updated 2 years ago
- Evaluate language models using multiple choice items☆13Mar 6, 2026Updated 3 months ago
- A comprehensive benchmark for entity disambiguation☆29Jun 29, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆16May 30, 2019Updated 7 years ago
- A dataset for realistic evaluation of noisy label methods☆15Dec 3, 2023Updated 2 years ago
- ☆14Jun 18, 2026Updated last week
- PyTorch-IE: State-of-the-art Information Extraction in PyTorch☆77Sep 24, 2025Updated 9 months ago
- ☆25Oct 5, 2020Updated 5 years ago
- A general Coverletter supported by typst☆16Dec 11, 2025Updated 6 months ago
- This repository contains my teaching material. Most of it is in German.☆13Jun 12, 2026Updated 2 weeks ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆219Mar 12, 2026Updated 3 months ago
- Collection of scripts that enhance Zotero☆18Mar 22, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Command-line tool and Rust library for handling Web ARChive (WARC) files☆32Jun 2, 2025Updated last year
- ECGDL: A framework for comparative study of databases and computational methods for arrhythmia detection from single-lead ECG☆20Aug 31, 2023Updated 2 years ago
- Few-Shot Relation Extraction with AllenNLP☆12Jan 27, 2019Updated 7 years ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆396May 11, 2023Updated 3 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 6 months ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 6 years ago
- The implementation for "Open Relation Modeling: Learning to Define Relations between Entities" (Findings of ACL '22)☆12Feb 28, 2022Updated 4 years ago
- ☆13Jul 13, 2021Updated 4 years ago
- ☆14Jan 25, 2026Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Jun 23, 2022Updated 4 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Aug 19, 2022Updated 3 years ago
- Browser based post correction tool for Alto XML files☆14Sep 20, 2013Updated 12 years ago
- ☆31Dec 13, 2023Updated 2 years ago
- Python implementation of the random-walk inductive classification algorithm Modified Adsorption from P. Talukdar☆15Jul 30, 2014Updated 11 years ago
- ☆40Aug 11, 2023Updated 2 years ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Aug 2, 2024Updated last year
- ☆19Jan 18, 2026Updated 5 months ago
- Search the biomedical literature for protein interactions and protein associations☆11Nov 24, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- European Parliament Open Data - Call for beta testers☆35Nov 11, 2024Updated last year
- ☆14Nov 22, 2013Updated 12 years ago
- Per-collection OCR leaderboards using VLM-as-judge☆60Jun 2, 2026Updated 3 weeks ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆11May 26, 2026Updated last month
- Repository for My HuggingFace Natural Language Processing Projects☆31Aug 31, 2023Updated 2 years ago
- MedDistant19: Towards an Accurate Benchmark for Broad-Coverage Biomedical Relation Extraction (COLING 2022)☆18Oct 13, 2022Updated 3 years ago
- A multilingual DeBERTa model fine-tuned on political communication to classify discrete emotions☆18Nov 10, 2023Updated 2 years ago