Layout-Parser / platform
☆10Updated 3 years ago
Alternatives and similar repositories for platform
Users that are interested in platform are comparing it to the libraries listed below
Sorting:
- multimodal document analysis☆164Updated 11 months ago
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sum…☆47Updated 2 years ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆176Updated 2 years ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆13Updated 9 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated last month
- An OCR evaluation tool☆65Updated 3 weeks ago
- Streamlit Named Entity Recognition (NER) annotation custom component☆38Updated 2 years ago
- Small python package to measure OCR quality and other related metrics.☆21Updated last year
- ☆13Updated last year
- Libraries, Archives and Museums (LAM)☆83Updated 2 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆53Updated last year
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆18Updated 9 months ago
- ☆80Updated 3 years ago
- OCR & Ground Truth Resources☆75Updated 3 years ago
- ☆67Updated last year
- Logical structure analysis for visually structured documents☆89Updated 2 years ago
- A suite of batches and tools for OCR tasks.☆71Updated 2 years ago
- DFKI Layout Detection for OCR-D☆47Updated 2 weeks ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated last year
- Collection of OCR-related python tools and wrappers from @OCR-D☆128Updated 2 weeks ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- link raw affiliation to ROR ids☆30Updated last year
- Seed Machine Translation Data☆31Updated 6 months ago
- Resources accompanying the "Zero-Shot Recommendation as Language Modeling" paper (ECIR2022)☆13Updated last year
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆66Updated 2 years ago
- Chrome Extension for exploring Hugging Face datasets 🔎☆50Updated 7 months ago
- Generate a SQLite database from Wikipedia & Wikidata dumps.☆35Updated last year
- Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)☆23Updated 2 years ago
- Scrollership through 20m pubmed abstracts.☆26Updated last year
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago