WebCuratorTool / webcurator
The root of the webcurator tool project, containing all modules needed to run a fully functional webcurator tool.
☆2Updated this week
Related projects: ⓘ
- simple script to convert web resources to a single warc file☆18Updated last year
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives☆13Updated 3 years ago
- CDXJ Indexing of WARC/ARCs☆21Updated 3 months ago
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user ac…☆49Updated 2 months ago
- CLI implementation of httpreserve that can test links and retrieve internet archive replacements☆9Updated last year
- Awesome list dedicated to digital and data preservation tools, sources, services and so on.☆19Updated last year
- Webrecorder Automated In-Page Behavior Framework☆12Updated 3 years ago
- Specification for authentication and creating signed WACZ Files☆9Updated 2 years ago
- A listing of world wide web archives, for humans and machines using Web Archive Manifest (WAM) yaml format☆42Updated last year
- A social media open post web archiving tool☆25Updated 3 months ago
- A Rails engine supporting the discovery of web archives.☆48Updated last year
- The ArchiveWeb.page Site☆27Updated 3 months ago
- Comparing warc files☆14Updated 5 years ago
- An Awesome List for getting started with web archiving☆17Updated 5 years ago
- Scripts for Internet Archive☆12Updated 4 years ago
- Web archive index server based on RocksDB☆31Updated last week
- Digital Preservation of HTTP in documentary heritage.☆22Updated last year
- Proxies third-party PDF files and HTML pages with the Hypothesis client embedded, so you can annotate them☆19Updated this week
- ArchiveBoxMatic: configure ArchiveBox with the simplicity of a yaml file.☆13Updated 3 years ago
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆13Updated 2 months ago
- A Github Action for turning Markdown into ReSpec HTML☆13Updated 3 months ago
- Static Site Generator for Viewing Web Archives (in WACZ) format☆20Updated last year
- Converts WARC files to static HTML☆38Updated 2 months ago
- Parse OCR result files for pagenos, tables of contents, etc.☆14Updated 12 years ago
- Selected code and data for The Online Books Page and related applications☆10Updated 2 weeks ago
- A list of things related to software, literature, and other content for 🕣 Memento☆85Updated 3 months ago
- Homebrew formula for the ArchiveBox self-hosted internet archiving solution.☆26Updated 7 months ago
- Deep Zoom Image Downloader☆17Updated 4 months ago
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from A…☆14Updated 3 years ago
- A Data Parsing/Data Manipulation Tool Supporting Digitization Projects and Other Data Analysis Projects☆47Updated 4 years ago