Scifabric / pdftranscribe
A simple PDF transcription project for PyBossa
☆19Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for pdftranscribe
- Python client library for controlling Google Refine☆83Updated 7 years ago
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 2 years ago
- Plots various graphs for a series of plaintext files in a directory☆19Updated 8 years ago
- Repository for the DMPTool Project☆36Updated 6 years ago
- Documents for the project Libraccess☆13Updated 9 years ago
- 💠 + 📚 OpenRefine on Binder!☆40Updated 5 months ago
- CoVE is an web application to Convert, Validate and Explore data following certain open data standards - including 360Giving, Open Contra…☆43Updated last month
- Adding links to full text in Wikipedia references☆37Updated 10 months ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- Create and validate Data Packages in the browser☆27Updated 2 years ago
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago
- Prototype SOLR-powered web archive exploration UI.☆43Updated 4 years ago
- A design prototype for DocNow to learn with☆14Updated 7 years ago
- New generation DH curation and visualization platform☆10Updated last month
- A place to collect and share knowledge about liberating data from PDFs☆53Updated 2 years ago
- a CLI suggestion tool for Wikidata entities☆29Updated 8 years ago
- Execute OpenRefine JSON scripts without OpenRefine (or Java)☆29Updated last year
- Convert XML/SVG/PDF into normalised, sectioned, scholarly HTML☆36Updated 10 months ago
- A fast, responsive HTML5 viewer for scanned items, developed for the World Digital Library. A project of the Library of Congress. Note: p…☆22Updated 9 years ago
- Tools for text tokenization and encoding☆84Updated 3 years ago
- Free-form web data notebook - "Data management for little guys"☆25Updated last year
- Adds the ability to transcribe items using the Scripto library.☆17Updated 4 months ago
- Django web application to display, annotate, and export digitized books.☆29Updated last week
- Library of Congress coding standards☆29Updated 5 months ago
- No longer maintained. Please use conciliator instead.☆26Updated 4 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 9 years ago
- The DPLA Platform☆64Updated 6 years ago
- The DDI Discovery Vocabulary, an RDF vocabulary for data description and discovery based on DDI☆25Updated last year
- a gauge widget to display wikipedia activity☆41Updated 6 years ago