LibraryOfCongress / newspaper-navigator
☆235Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for newspaper-navigator
- Distant Viewing Toolkit for the Analysis of Visual Culture☆91Updated 2 months ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆203Updated last year
- Using ML to extract campaign finance data from messy forms for journalism☆76Updated 2 years ago
- Tutorials for working with Library of Congress collections data☆192Updated this week
- Document Layout Analysis☆350Updated this week
- A Large Dataset of Historical Japanese Documents with Complex Layouts☆32Updated 2 years ago
- A suite of batches and tools for OCR tasks.☆71Updated last year
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆52Updated last year
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆124Updated 3 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆180Updated last month
- Generic framework for historical document processing☆373Updated 3 years ago
- Collection of OCR-related python tools and wrappers from @OCR-D☆120Updated this week
- Detect and align similar passages☆89Updated 2 months ago
- A collection of notebooks for Natural Language Processing☆24Updated this week
- Code for the CUP Elements on text analysis in Python for social scientists☆135Updated 2 years ago
- dhSegment on pytorch☆32Updated last year
- Toolbox for OCR post-correction☆123Updated 5 years ago
- A collection of Jupyter notebooks in many human and computer languages for doing digital humanities. PRs welcome!☆125Updated last year
- Scripts that clean up OCR and munge Hathi metadata.☆74Updated 7 years ago
- Python 3 library for processing historical English☆64Updated 3 months ago
- Software that makes labeling PDFs easy.☆391Updated 6 months ago
- A Python wrapper around the topic modeling functions of MALLET.☆99Updated 3 weeks ago
- Iconographic Visualization Inside Computational Notebooks☆35Updated 3 months ago
- Introduction to Cultural Analytics & Python, course website and online textbook powered by Jupyter Book☆259Updated 8 months ago
- Page to PAGE Layout Analysis Tool☆191Updated 2 years ago
- Highlighting various OCR formats directly in Solr☆84Updated this week
- Detectron2 for Document Layout Analysis☆185Updated 3 months ago
- Fork of dhSegment for experiments on visual and textual feature combination.☆15Updated 3 years ago
- Text and statistics utilities from Pew Research Center☆82Updated 2 years ago
- ☆42Updated 3 months ago