LibraryOfCongress / newspaper-navigator
☆238Updated 2 years ago
Alternatives and similar repositories for newspaper-navigator:
Users that are interested in newspaper-navigator are comparing it to the libraries listed below
- Collection of OCR-related python tools and wrappers from @OCR-D☆121Updated last week
- A Large Dataset of Historical Japanese Documents with Complex Layouts☆32Updated 2 years ago
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆124Updated 3 years ago
- Digital Humanities Across Borders☆47Updated 9 months ago
- Practical Approaches to Data Science with Text☆39Updated 5 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆52Updated last year
- Detect and align similar passages☆92Updated last month
- A suite of batches and tools for OCR tasks.☆71Updated last year
- Using ML to extract campaign finance data from messy forms for journalism☆76Updated 2 years ago
- Toolbox for OCR post-correction☆122Updated 5 years ago
- Master repository which includes most other OCR-D repositories as submodules☆72Updated 3 months ago
- Document Layout Analysis☆359Updated 3 weeks ago
- Distant Viewing Toolkit for the Analysis of Visual Culture☆92Updated 4 months ago
- A Python library for topic modeling and visualization☆64Updated 4 years ago
- A collection of notebooks for Natural Language Processing☆25Updated this week
- dhSegment on pytorch☆33Updated last year
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆183Updated 3 months ago
- Detectron2 for Document Layout Analysis☆185Updated 5 months ago
- An OCR evaluation tool☆64Updated last month
- Quote extraction for modular journalism (JournalismAI collab 2021)☆226Updated 2 years ago
- Fuzzy matching and more functionality for spaCy.☆255Updated 6 months ago
- Scripts that clean up OCR and munge Hathi metadata.☆75Updated 7 years ago
- Tutorials for working with Library of Congress collections data☆195Updated this week
- Official syllabus and course materials for English 184E: “Literary Text Mining” (Spring 2019)☆18Updated 4 years ago
- Python 3 library for processing historical English☆64Updated 5 months ago
- Python tools for performing various operations on ALTO XML files☆40Updated last year
- Text analysis with networks.☆286Updated 8 months ago
- Page to PAGE Layout Analysis Tool☆191Updated 3 years ago
- A hands-on activity in linking and enriching geo-data, part of the Linked Pasts conference☆14Updated 3 years ago
- High-performance text aligner for large collections of texts☆46Updated 2 months ago