internetarchive / pdf_trioLinks
A PDF classifier ensemble with REST API service
☆23Updated 4 years ago
Alternatives and similar repositories for pdf_trio
Users that are interested in pdf_trio are comparing it to the libraries listed below
Sorting:
- MOVED to https://gitlab.com/crossref/reference_matching_evaluation_framework☆17Updated 5 years ago
- web app for visualizing Wikidata items on a timeline☆16Updated 6 years ago
- A browser extension providing Open Access bibliographical services☆17Updated 2 years ago
- Open Access PDF harvester☆40Updated last year
- Translate by annotating☆17Updated 8 years ago
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆25Updated 7 years ago
- IEEE Taxonomy in RDF (with Python tool for converting it from txt to rdf)☆11Updated 3 months ago
- Specification for authentication and creating signed WACZ Files☆10Updated 3 years ago
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago
- Scripts for Wikidata☆20Updated 2 months ago
- WASAPI data transfer APIs☆44Updated 3 years ago
- Adding links to full text in Wikipedia references☆37Updated this week
- Various examples of notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archive…☆26Updated 2 years ago
- curation workflow automation and coordination☆42Updated 4 months ago
- The One True Open Access Button - cross-compatible extension for research papers and data.☆45Updated 8 months ago
- Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki☆26Updated 10 months ago
- DBpedia, which frequently crawls and analyses over 120 Wikipedia language editions has near complete information about (1) which facts ar…☆11Updated 2 years ago
- Digital Preservation of HTTP in documentary heritage.☆22Updated 2 years ago
- Installer for Thymeflow, a personal knowledge management system.☆33Updated 7 years ago
- Trough: Big data, small databases.☆42Updated 10 months ago
- Sort-friendly URI Reordering Transform (SURT) python module☆42Updated 10 months ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Updated 3 years ago
- Specifications of the reconciliation API☆34Updated last week
- Wikipedia citation tool for Google Books, New York Times, ISBN, DOI and more☆22Updated 8 years ago
- Metadata and per-statute PDFs for the U.S. Statutes at Large through volume 64 (1789-1951).☆17Updated 5 years ago
- OpenAIRE Guidelines for Literature Repository Managers based on Dublin Core and DataCite Metadata Kernel☆13Updated last year
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- Named-Entity Recognition extension for OpenRefine☆28Updated 2 years ago
- Open ONI (Open Online Newspaper Initiative) Django web app☆50Updated 2 months ago
- A deep learning architecture for reference mining from literature in the arts and humanities.☆16Updated 5 years ago