DocTAG / doctag-core
This is the main repository for the DocTAG annotation tool. DocTAG is a portable, customizable annotation tool specifically designed for the Information Retrieval (IR) domain.
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for doctag-core
- ☆15Updated last month
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆53Updated 3 months ago
- ☆83Updated 2 months ago
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆31Updated this week
- A Python library aimed at dissecting and augmenting NER training data.☆57Updated last year
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆42Updated last year
- A Python Interface to Reproducibility Measures of System-Oriented IR Experiments☆11Updated 2 years ago
- An opensource TAR framework for experiments and applications☆16Updated 8 months ago
- Get annotation suggestions for the INCEpTION text annotation platform from spaCy, Sentence BERT, scikit-learn and more. Runs as a web-ser…☆40Updated last month
- Bi-encoder Based Entity Linking Tutorial. You can run experiment only in 5 minutes. Experiments on Co-lab pro GPU are also supported!☆33Updated 3 years ago
- ↕️ Intuitive axiomatic retrieval experimentation.☆23Updated last week
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆27Updated 2 years ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆103Updated 7 months ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated 6 months ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 2 years ago
- SciRepEval benchmark training and evaluation scripts☆67Updated 6 months ago
- This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLP☆20Updated 10 months ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆50Updated last year
- MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer☆33Updated 2 years ago
- A High-level Library for Named Entity Recognition in Python.☆22Updated 11 months ago
- A Workbench for Autograding Retrieve/Generate Systems☆13Updated 3 weeks ago
- ☆82Updated 6 months ago
- ☆10Updated 2 years ago
- ☆45Updated 2 years ago
- Automatically detect errors in annotated corpora.☆47Updated last year
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆43Updated 6 months ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆64Updated 2 years ago
- A Benchmark Workflow and Dataset Collection for Query Refinement☆25Updated last year
- ☆39Updated last year
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆85Updated 2 years ago