DocTAG / doctag-core
This is the main repository for the DocTAG annotation tool. DocTAG is a portable, customizable annotation tool specifically designed for the Information Retrieval (IR) domain.
☆19Updated 2 years ago
Alternatives and similar repositories for doctag-core:
Users that are interested in doctag-core are comparing it to the libraries listed below
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆105Updated 11 months ago
- The Plumber framework for KG completion and structured triples extraction☆21Updated last year
- Bi-encoder Based Entity Linking Tutorial. You can run experiment only in 5 minutes. Experiments on Co-lab pro GPU are also supported!☆34Updated 3 years ago
- Mining Legal Arguments in Court Decisions - Data and software☆66Updated last year
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆85Updated 2 years ago
- Repository for the paper "Named Entity Recognition for Entity Linking: What Works and What's Next" (EMNLP 2021).☆75Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- ☆15Updated 3 months ago
- SciRepEval benchmark training and evaluation scripts☆73Updated 10 months ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆121Updated 11 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆107Updated 10 months ago
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- Spacy pipeline object for extracting values that correspond to a named entity (e.g., birth dates, account numbers, laboratory results)☆54Updated 2 years ago
- MTab: Entity Search and Table Annotation with Wikidata, Wikipedia, and DBpedia☆30Updated 2 years ago
- ☆85Updated 10 months ago
- ☆158Updated 9 months ago
- cRocoDiLe is a dataset extraction tool for Relation Extraction using Wikipedia and Wikidata presented in REBEL (EMNLP 2021).☆65Updated last year
- XAI based human-in-the-loop framework for automatic rule-learning.☆48Updated 8 months ago
- KIND: an Italian Multi-Domain Dataset for Named Entity Recognition☆15Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆56Updated 8 months ago
- Biomedical Data-to-Text Generation via Fine-Tuning Transformers☆28Updated 3 years ago
- Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)☆22Updated 2 years ago
- A Large Semantic Knowledge Graph from Wikipedia Categories and Listings☆26Updated last year
- An easy-to-use API for analyzing INCEpTION annotation projects.☆17Updated last year
- A comprehensive benchmark for entity disambiguation☆25Updated last year
- Tool for disambiguating acronyms and abbreviations in text for NLP applications☆22Updated 9 months ago
- 💫 SpaCy wrapper for ConceptNet 💫☆90Updated last year
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- ☆38Updated 3 months ago
- ☆37Updated 2 years ago