emijrp / internet-archiveLinks
Scripts for Internet Archive
☆13Updated 9 months ago
Alternatives and similar repositories for internet-archive
Users that are interested in internet-archive are comparing it to the libraries listed below
Sorting:
- CDXJ Indexing of WARC/ARCs☆31Updated last year
- Perpetual Access To The Scholarly Record☆120Updated last year
- search interface for scholarly works☆85Updated last year
- Selected code and data for The Online Books Page and related applications☆11Updated 3 weeks ago
- A Memento Aggregator CLI and Server in Go☆73Updated 9 months ago
- A Rails engine supporting the discovery of web archives.☆50Updated 2 years ago
- A listing of world wide web archives, for humans and machines using Web Archive Manifest (WAM) yaml format☆52Updated 3 years ago
- Distributed Proofreaders is a web application intended to ease the process of converting public domain books into e-texts.☆55Updated 2 weeks ago
- Trough: Big data, small databases.☆41Updated last year
- metawarc: a command-line tool for metadata extraction from files from WARC (Web ARChive)☆34Updated 2 months ago
- Various examples of notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archive…☆26Updated 3 years ago
- Open ONI (Open Online Newspaper Initiative) Django web app☆52Updated 8 months ago
- Comparing warc files☆17Updated 6 years ago
- A set of utilities for processing MediaWiki XML dump data.☆61Updated 10 months ago
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user ac…☆55Updated 4 months ago
- The One True Open Access Button - cross-compatible extension for research papers and data.☆48Updated last year
- Web archive index server based on RocksDB☆37Updated last week
- A Memento TimeGate☆44Updated 5 years ago
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆12Updated last year
- A collection of tools for archiving and analysing the internet.☆78Updated 3 years ago
- Tool to import files from the Internet Archive to Wikimedia Commons.☆18Updated 2 weeks ago
- Arquivo.pt main goal is the preservation and access of web contents that are no longer available online. During the developing of the PW…☆52Updated last month
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆58Updated last year
- Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in t…☆131Updated last month
- Tools for helping you work with web platform archive downloads.☆18Updated 5 years ago
- freeyourstuff.cc - universal content liberation☆81Updated 2 years ago
- A fun tool for quickly browsing unsourced snippets on Wikipedia.☆112Updated 3 weeks ago
- Command line tool for digging into WARC files☆49Updated 2 weeks ago
- A repository for generating PKP's documentation hub.☆17Updated this week
- A list of things related to software, literature, and other content for 🕣 Memento☆102Updated last year