sarahalang / LLM-powered-OCR-correctionLinks
This repo contains files downloaded from Transkribus with corresponding suggested OCR improvements (performed using ChatGPT AI).
☆13Updated 5 months ago
Alternatives and similar repositories for LLM-powered-OCR-correction
Users that are interested in LLM-powered-OCR-correction are comparing it to the libraries listed below
Sorting:
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆54Updated 2 years ago
- A Mashup Interface for Text Analysis Operations☆13Updated 6 months ago
- A collection of open source tools and resources related to Wikibase knowledge graphs☆72Updated last year
- A deep learning architecture for reference mining from literature in the arts and humanities.☆16Updated 5 years ago
- ☆49Updated 11 months ago
- nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset☆17Updated 9 months ago
- Ricgraph - Research in context graph☆29Updated last week
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- A collection of notebooks for Natural Language Processing☆25Updated 6 months ago
- Named Entity Recognition API used by TEI Publisher☆20Updated last year
- Wrapper for DKPro Core to extract lingustic information from books.☆16Updated 3 years ago
- ☆32Updated 2 years ago
- Python based Wikidata framework for easy dataframe extraction☆45Updated last year
- Knowledge graph construction: Fast inserts into a Wikibase instance☆45Updated 3 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆30Updated 2 years ago
- Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup☆70Updated last month
- A collection of computer tools for aiding the text critical workflow from transcription to collation to analysis.☆23Updated 3 months ago
- Ergonomic line-by-line transcription of scanned text.☆53Updated 4 years ago
- example of using RDFlib to take a CSV and make triples from it☆26Updated 7 years ago
- Pytorch implementation of a BiLSTM model for the Wikification project.☆19Updated 5 years ago
- Libraries, Archives and Museums (LAM)☆84Updated 2 years ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆18Updated 11 months ago
- OpenAtlas is an open source, web based database system for complex archaeological, historical and prosopographical data.☆27Updated last week
- ☆55Updated last year
- Named entity recognition for the legal domain☆42Updated 4 years ago
- MeMAD multimodal content analysis and machine translation: collection of tools and libraries☆12Updated 4 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 2 years ago
- Jupyter notebooks with examples of querying different PID graphs and providers like OpenAlex, FREYA PID Graph, OpenAIRE, ORCID, ROR, Cros…☆24Updated 2 years ago
- Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)☆23Updated 2 years ago