internetarchive / sandcrawler

Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki
25Updated 6 months ago

Alternatives and similar repositories for sandcrawler:

Users that are interested in sandcrawler are comparing it to the libraries listed below