A tool for detecting viruses and NSFW material in WARC files
☆18Jun 9, 2026Updated 3 weeks ago
Alternatives and similar repositories for warc-safe
Users that are interested in warc-safe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Jan 3, 2024Updated 2 years ago
- Repository for revision of PREMIS OWL ontology group☆13May 12, 2022Updated 4 years ago
- Django app for managing PREMIS Events☆14Apr 28, 2026Updated 2 months ago
- This software (prototype) extracts values of Excel spreadsheet properties and calculates a tentative spreadsheet complexity assessment ba…☆13May 15, 2026Updated last month
- Digital preservation policies and strategies☆12Mar 29, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Nov 21, 2025Updated 7 months ago
- A whirlwind tour of Common Crawl's data using Python☆45Jun 15, 2026Updated 2 weeks ago
- Web archive index server based on RocksDB☆43Jun 8, 2026Updated 3 weeks ago
- Documentation for the Site Scanning program☆20Jun 17, 2026Updated 2 weeks ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)☆98Apr 22, 2025Updated last year
- XML Schema for Digital Forensics XML☆35Feb 7, 2025Updated last year
- ☆17Nov 26, 2024Updated last year
- Selected code and data for The Online Books Page and related applications☆11Jun 1, 2026Updated last month
- Tools to analyze web archives☆20Jul 12, 2016Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Single server/laptop grade file-observatory☆10Mar 30, 2023Updated 3 years ago
- Find old YouTube gems that the algorithm hides.☆28Sep 14, 2025Updated 9 months ago
- Python binding for gumbo-parser using Cython☆14Aug 16, 2016Updated 9 years ago
- Command line $MFT record decoder☆12May 20, 2017Updated 9 years ago
- 🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser en…☆20Jul 11, 2025Updated 11 months ago
- How Media Cloud approaches extracting metadata from online news stories☆17Apr 15, 2026Updated 2 months ago
- Loader software for automated imaging of optical media with Nimbie disc robot☆37Apr 28, 2026Updated 2 months ago
- ☆10Apr 28, 2025Updated last year
- ☆38Jun 16, 2026Updated 2 weeks ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A persistent repository for PRONOM Research Week activities☆12May 26, 2021Updated 5 years ago
- Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki☆28Jul 31, 2024Updated last year
- High Availability Shared Pipeline Engine☆17Sep 15, 2023Updated 2 years ago
- Digital Forensics Essentials (DFE)☆14Mar 18, 2024Updated 2 years ago
- Targeted PDFs demonstrating commonly seen PDF differentials and interoperability issues☆15Mar 20, 2026Updated 3 months ago
- Demo app built using AngularJS with Backand serving as the back end☆13Mar 1, 2017Updated 9 years ago
- Library for Object Linking and Embedding (OLE) data types☆12Jun 24, 2026Updated last week
- Illuminating the scope and content of a digital text collections☆13Jul 28, 2015Updated 10 years ago
- Presenting the Danish National Archives' Concept Model for Development of Preservation Plans.☆15May 16, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A classifier for detecting soft 404 pages☆17Sep 10, 2022Updated 3 years ago
- 🔬Experimental Minio (S3) Gateway for iRODS 💾☆12Aug 13, 2019Updated 6 years ago
- mutant standard style graphics of real trains☆15Aug 19, 2024Updated last year
- WASAPI data transfer APIs☆50Apr 23, 2022Updated 4 years ago
- Explore and analyze large datasets of images☆28Jun 19, 2026Updated 2 weeks ago
- File detector, metadata collector and well-formedness checker tool☆18Jun 3, 2026Updated last month
- Command-line tool for calculating the number of days between given dates: days until, days since, days from☆11Feb 23, 2024Updated 2 years ago