LibraryOfCongress / newspaper-navigator
☆243Updated 3 years ago
Alternatives and similar repositories for newspaper-navigator:
Users that are interested in newspaper-navigator are comparing it to the libraries listed below
- Collection of OCR-related python tools and wrappers from @OCR-D☆128Updated last week
- Using ML to extract campaign finance data from messy forms for journalism☆76Updated 2 years ago
- Tutorials for working with Library of Congress collections data☆203Updated last month
- A Large Dataset of Historical Japanese Documents with Complex Layouts☆32Updated 2 years ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆144Updated 5 months ago
- Practical Approaches to Data Science with Text☆39Updated 5 years ago
- Toolbox for OCR post-correction☆121Updated 5 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆187Updated last month
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆126Updated 3 years ago
- A suite of batches and tools for OCR tasks.☆71Updated last year
- A Twitter data collection and appraisal application.☆51Updated 2 years ago
- OCR evaluation brought to you by University of Alicante☆67Updated 2 years ago
- An OCR evaluation tool☆65Updated last month
- Python 3 library for processing historical English☆66Updated 7 months ago
- Master repository which includes most other OCR-D repositories as submodules☆72Updated last month
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆53Updated last year
- Document Layout Analysis☆360Updated this week
- Python tools for performing various operations on ALTO XML files☆45Updated 3 weeks ago
- OCR-D-compliant page segmentation☆67Updated 2 weeks ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆207Updated last year
- Quote extraction for modular journalism (JournalismAI collab 2021)☆227Updated 3 years ago
- NERtwork is a collection of scripts to help you create a network graph of co-occurring named entities using open source tools. This is do…☆48Updated 11 months ago
- Scripts and results from our OCR roundup, available on Source☆150Updated 6 years ago
- Detect and align similar passages☆98Updated last month
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- A textual corpus database for the digital humanities.☆61Updated 4 years ago
- Distant Viewing Toolkit for the Analysis of Visual Culture☆93Updated 6 months ago
- High-performance text aligner for large collections of texts☆50Updated 4 months ago
- The repository and website hosting the peer review process for new Programming Historian lessons☆143Updated this week