A tool for detecting viruses and NSFW material in WARC files
☆17Dec 16, 2025Updated 2 months ago
Alternatives and similar repositories for warc-safe
Users that are interested in warc-safe are comparing it to the libraries listed below
Sorting:
- ☆14Jan 3, 2024Updated 2 years ago
- Digital preservation policies and strategies☆12Mar 29, 2024Updated last year
- Repository for revision of PREMIS OWL ontology group☆13May 12, 2022Updated 3 years ago
- This software (prototype) extracts values of Excel spreadsheet properties and calculates a tentative spreadsheet complexity assessment ba…☆13Dec 13, 2022Updated 3 years ago
- Django app for managing PREMIS Events☆14Updated this week
- Web archive index server based on RocksDB☆38Updated this week
- 🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser en…☆19Jul 11, 2025Updated 7 months ago
- A whirlwind tour of Common Crawl's data using Python☆35Feb 17, 2026Updated 2 weeks ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)☆93Apr 22, 2025Updated 10 months ago
- A user-friendly Command & Control (C&C) web platform for remote monitoring, management, and task automation across multiple devices.☆14Dec 15, 2024Updated last year
- XML Schema for Digital Forensics XML☆35Feb 7, 2025Updated last year
- CDXJ Indexing of WARC/ARCs☆33Dec 10, 2024Updated last year
- ☆36Jan 21, 2026Updated last month
- Loader software for automated imaging of optical media with Nimbie disc robot☆36Mar 10, 2025Updated 11 months ago
- Imports events from remotely-located iCalendar files into The Events Calendar plugin for WordPress.☆10Jun 26, 2025Updated 8 months ago
- Description des formats de fichier☆11Feb 4, 2022Updated 4 years ago
- ☆17Feb 20, 2026Updated last week
- A manga reader plugin for KOReader that connects to Kavita self-hosted digital library.☆28Feb 11, 2026Updated 2 weeks ago
- ⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 …☆96Dec 31, 2025Updated 2 months ago
- ☆14Jun 10, 2025Updated 8 months ago
- Command line tool for digging into WARC files☆51Updated this week
- OpenPGP in Python using Sequoia PGP☆18Feb 25, 2026Updated last week
- ☆10Apr 28, 2025Updated 10 months ago
- Code for preservation simulation/modeling project☆10Aug 24, 2021Updated 4 years ago
- A fork of the disktype disk and disk image format detection tool☆11Nov 16, 2016Updated 9 years ago
- This repository contains examples of XML and XSLT files that can be used to control adding/viewing/editing/indexing of metadata in Preser…☆11Jan 8, 2019Updated 7 years ago
- Application which supports the UNC Libraries' Digital Collections Repository☆12Updated this week
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Apr 27, 2020Updated 5 years ago
- This repository holds the sources for plugins that interface with the G'MIC library (http://gmic.eu)☆12Mar 4, 2023Updated 3 years ago
- Automatically generate tests for your website by using LLM models☆17Aug 7, 2023Updated 2 years ago
- SIARD (Software Independent Archiving of Relational Databases) - an open file format for the long-term archiving of relational databases☆12Nov 14, 2024Updated last year
- Carefully curated list of awesome digital preservation resources.☆125Aug 1, 2025Updated 7 months ago
- VIVOTO is an android simple video and photo editor that can remove anything that you want to remove object. In this app, you can use trim…☆11Jun 16, 2020Updated 5 years ago
- CV approach aimed to remove moving objects in videos (dynamic and static camera)☆11Mar 21, 2021Updated 4 years ago
- This is the ETL lib package. It provides an API to munge and prepare JSON, TSV and other data using Apache Tika and JSON parsing/loading …☆18Jan 27, 2024Updated 2 years ago
- ☆10Apr 5, 2022Updated 3 years ago
- PERICLES Extraction Tool☆17May 12, 2017Updated 8 years ago
- ☆16Nov 26, 2024Updated last year
- ☆29Oct 25, 2025Updated 4 months ago