paperai / pdfanno
Linguistic Annotation and Visualization Tool for PDF Documents
☆200Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for pdfanno
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆65Updated 4 years ago
- Toolbox for OCR post-correction☆123Updated 5 years ago
- 🆕 Work continues on INCEpTION 👉 https://github.com/inception-project/inception 👈 -- ⚠️ The official WebAnno repository has reached the…☆245Updated last year
- A modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.☆91Updated 2 years ago
- PDF to XML ALTO file converter☆216Updated 2 months ago
- Neuralized version of the Reference String Parser component of the ParsCit package.☆78Updated 2 years ago
- a Deep Learning Framework for Text https://delft.readthedocs.io/☆389Updated 2 weeks ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆110Updated 4 months ago
- Anafora is a web-based raw text annotation tool☆241Updated 2 years ago
- High-level build project for all LAPDF-Text submodules☆103Updated 9 years ago
- Table Extraction Tool☆90Updated 6 years ago
- GROBID extension for identifying and normalizing physical quantities.☆75Updated 2 months ago
- Framework for information extraction from tables☆42Updated 5 years ago
- Software that makes labeling PDFs easy.☆391Updated 6 months ago
- PAGE XML format collection for document image page content and more☆66Updated 3 years ago
- Python library for Natural Language Preprocessing (NLPre)☆190Updated last year
- Command line tool to extract figures, tables, and captions from scholarly documents in PDF form.☆129Updated 6 years ago
- 🏖TagEditor - Annotation tool for spaCy☆187Updated 2 years ago
- Page to PAGE Layout Analysis Tool☆191Updated 2 years ago
- 🚀GUI for training spaCy models☆53Updated 3 years ago
- A Named-Entity Recogniser based on Grobid.☆49Updated 2 months ago
- PDF.js + Hypothesis viewer / annotator☆375Updated 2 years ago
- DFKI Layout Detection for OCR-D☆47Updated 2 weeks ago
- A simple viewer and inspection tool for text boxes in PDF documents☆92Updated 2 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆180Updated last week
- OCR evaluation brought to you by University of Alicante☆67Updated 2 years ago
- An open-source CRF Reference String Parsing Package☆155Updated 4 years ago
- A tool for visualizing trees, tailored specifically to the analysis of parse trees.☆81Updated 4 years ago
- Python binding to libpoppler with focus on text extraction☆98Updated 2 years ago
- CoNLL-U format library for JavaScript☆72Updated 7 years ago