staffanm / ferenda
Transform unstructured document collections to structured Linked Data
☆27Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ferenda
- Adding links to full text in Wikipedia references☆37Updated 10 months ago
- A PDF classifier ensemble with REST API service☆23Updated 3 years ago
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 2 years ago
- An index data structure for approximate string search.☆23Updated 5 years ago
- CSV on the web☆37Updated 3 weeks ago
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆23Updated 9 months ago
- Execute OpenRefine JSON scripts without OpenRefine (or Java)☆29Updated last year
- Ergonomic line-by-line transcription of scanned text.☆47Updated 3 years ago
- Small Python library to validate persistent identifiers used in scholarly communication.☆28Updated this week
- Data Package reader for Pandas☆19Updated last year
- CSV inspection☆10Updated last year
- Free-form web data notebook - "Data management for little guys"☆25Updated last year
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 8 years ago
- Example SPARQL queries, mostly for working with ZBW data sets☆15Updated 2 months ago
- Python language parser for a tabular format for structured metadata. http://metatab.org☆17Updated last year
- Provide partial dates and retain the date precision through processing☆13Updated last year
- Linked SDMX☆17Updated 10 years ago
- Loading OpenSanctions into Neo4J and Linkurious☆27Updated last month
- Rails application to support the Sloan Dash grant project for self-deposit submission of scholarly works.☆16Updated 5 years ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- Get the scholarly citation for any research product: software, preprint, paper, or dataset☆69Updated last year
- OpenRefine reconciler for Research Organization Registry☆13Updated last year
- Sidewall is a Python library for interacting with the Dimensions search API.☆17Updated 2 months ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated last year
- Web hub based on Wikidata☆36Updated last year
- LoadKit supports Extract, Transform, Load processes based on ArchiveKit buckets.☆11Updated 9 years ago
- Codemeta paper.☆10Updated 7 years ago
- A project to coordinate implementing a system to signal whether references cited on Wikipedia are free to reuse☆19Updated 7 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆24Updated 2 years ago