metawarc: a command-line tool for metadata extraction from files from WARC (Web ARChive)
☆35Oct 27, 2025Updated 5 months ago
Alternatives and similar repositories for metawarc
Users that are interested in metawarc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Decentralized web archiving☆20Aug 7, 2018Updated 7 years ago
- A list of hashtags that bots automatically retweet. Use them to increase the reach of your tweets and increase the number of followers on…☆16Dec 13, 2021Updated 4 years ago
- Crawler that retrieves commoncrawl's crawled hosts and their corresponding IPs☆21Sep 1, 2025Updated 7 months ago
- A UserScript to detect GPT generated comments on Hackernews.☆13Dec 10, 2022Updated 3 years ago
- Simple, fast dictionary-based language detector for short texts.☆20Feb 5, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Napkin is a simple tool to produce statistical analysis of a text☆12Feb 25, 2024Updated 2 years ago
- genAI agent providing security context, tooling for performing security analysis on CVE, components and more☆22Updated this week
- html,css☆12Sep 24, 2021Updated 4 years ago
- List of all pastebin.com analogs I know of. They are useful for finding leaked personal data☆22Jul 18, 2021Updated 4 years ago
- Google's list of Certificate Transparency logs as a rust crate for use with sct.rs☆14Feb 17, 2023Updated 3 years ago
- Cross-platform CLI tool to make remote command execution in AWS a breeze☆13Feb 25, 2023Updated 3 years ago
- the source code that powered gitlive.net☆11Feb 12, 2016Updated 10 years ago
- Template for new OSINT command-line tools☆75Nov 25, 2024Updated last year
- ☆10Jan 29, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Passivedns monitor implementation in Rust.☆12Apr 21, 2016Updated 9 years ago
- A curated blocklist of Autonomous System Numbers (ASNs) associated with VPN providers, datacenters, and hosting services commonly used fo…☆17Mar 11, 2026Updated last month
- External twitter feeder for AIL framework☆16Apr 16, 2023Updated 2 years ago
- TikTok Scraper. Download video posts, collect user/trend/hashtag/music feed metadata, sign URL and etc.☆46Dec 19, 2021Updated 4 years ago
- Proof of Concept OSINT visualization☆12Dec 29, 2017Updated 8 years ago
- s3 as a datastore: A way to use S3 as a key-value datastore instead of a real datastore. can be read as s3aadatastore☆14Mar 16, 2023Updated 3 years ago
- A tool for collection archival slivers of the web and web archives☆17Feb 18, 2025Updated last year
- ☆24Mar 12, 2025Updated last year
- An IIIF Universe for IIIF catalogs☆27Apr 8, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- OSINT: come iniziare. Strumenti e idee per raccogliere e analizzare fonti aperte.☆11Mar 28, 2021Updated 5 years ago
- Capture a URL with Playwright☆31Updated this week
- Base45☆22Feb 20, 2026Updated last month
- IIIF experiments with Gallica content☆31Nov 16, 2025Updated 4 months ago
- Python SDK and CLI utility for searchcode.com.☆10Updated this week
- CS:GO Config is an updateable team config with support for personal settings.☆10Jul 22, 2022Updated 3 years ago
- Network scan tool for host and service discovery. Written in Rust.☆22Feb 17, 2026Updated last month
- CyCAT.org API back-end server including crawlers☆29Feb 4, 2023Updated 3 years ago
- Quick Cache and Archive search buttons☆38May 11, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A fast LuaJIT 2D, 3D, and 4D vector library☆12Nov 2, 2018Updated 7 years ago
- Lua with more or less typing. You type less; we type check.☆14Jan 1, 2026Updated 3 months ago
- Twitch Streamer Analysis, see Twitchverse https://towardsdatascience.com/twitchverse-a-network-analysis-of-twitch-universe-using-neo4j-gr…☆18Oct 25, 2024Updated last year
- A social media analytical tool that provides the useful insights and meaningful stats that helps in the enhancing your social media prese…☆10Oct 21, 2020Updated 5 years ago
- Open source & free software that measures the compliance of any webpage with the #GDPR by analyzing its source code and its behaviour. #p…☆17Dec 8, 2022Updated 3 years ago
- Converts binary files of 1C (1CD, cf, epf, efd, etc.) to grepable CSV☆13Feb 12, 2024Updated 2 years ago
- A collection of cyberchef recipes for use in osint investigations☆14Jul 2, 2022Updated 3 years ago