paperai / pdfanno
Linguistic Annotation and Visualization Tool for PDF Documents
β200Updated 5 years ago
Alternatives and similar repositories for pdfanno:
Users that are interested in pdfanno are comparing it to the libraries listed below
- A modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.β94Updated 3 years ago
- π Work continues on INCEpTION π https://github.com/inception-project/inception π -- β οΈ The official WebAnno repository has reached theβ¦β244Updated 2 years ago
- Anafora is a web-based raw text annotation toolβ241Updated 2 years ago
- a Deep Learning Framework for Text https://delft.readthedocs.io/β392Updated last week
- GROBID extension for identifying and normalizing physical quantities.β77Updated 5 months ago
- High-level build project for all LAPDF-Text submodulesβ103Updated 9 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.gβ¦β111Updated 3 weeks ago
- Neuralized version of the Reference String Parser component of the ParsCit package.β80Updated 2 years ago
- Toolbox for OCR post-correctionβ122Updated 5 years ago
- A Python implementation of the SimString, a simple and efficient algorithm for approximate string matching.β122Updated last year
- PDF to XML ALTO file converterβ224Updated this week
- A python library for automatic semantic graph generation from human-readable text.β27Updated 5 years ago
- Software that makes labeling PDFs easy.β405Updated 9 months ago
- LanguageCrunch NLP server docker imageβ287Updated 2 years ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF β¦β66Updated 4 years ago
- A tool for visualizing trees, tailored specifically to the analysis of parse trees.β81Updated 4 years ago
- A collection of simple tutorials for using Fonduerβ99Updated 4 years ago
- NanigoNet β Language detector for code-mixed input supporting 150+19 human+programming languages using deep neural networksβ72Updated last year
- INCEpTION provides a semantic annotation platform offering intelligent annotation assistance and knowledge management.β609Updated this week
- Hunspell extension for spaCy 2.0.β94Updated 6 months ago
- β40Updated 6 years ago
- πGUI for training spaCy modelsβ54Updated 3 years ago
- TETRE: a Toolkit for Exploring Text for Relation Extractionβ75Updated 7 years ago
- Table Extraction Toolβ90Updated 6 years ago
- The official tool for transforming doccano format into common dataset formats.β106Updated last year
- CUI-based Tree Visualizer for Universal Dependencies and Immediate Catena Analysisβ108Updated 2 months ago
- AmbiverseNLU: A Natural Language Understanding suite by Max Planck Institute for Informaticsβ209Updated last year
- Command line tool to extract figures, tables, and captions from scholarly documents in PDF form.β130Updated 6 years ago
- πTagEditor - Annotation tool for spaCyβ190Updated 2 years ago
- CoNLL-U format library for JavaScriptβ72Updated 7 years ago