metawarc: a command-line tool for metadata extraction from files from WARC (Web ARChive)
☆34Oct 27, 2025Updated 6 months ago
Alternatives and similar repositories for metawarc
Users that are interested in metawarc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A list of hashtags that bots automatically retweet. Use them to increase the reach of your tweets and increase the number of followers on…☆16Dec 13, 2021Updated 4 years ago
- ☆11Jul 20, 2023Updated 2 years ago
- Crawler that retrieves commoncrawl's crawled hosts and their corresponding IPs☆21Sep 1, 2025Updated 8 months ago
- DomainsProject.org HTTP worker☆25Dec 11, 2022Updated 3 years ago
- A UserScript to detect GPT generated comments on Hackernews.☆13Dec 10, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Simple, fast dictionary-based language detector for short texts.☆20Feb 5, 2026Updated 3 months ago
- Napkin is a simple tool to produce statistical analysis of a text☆12Feb 25, 2024Updated 2 years ago
- List of all pastebin.com analogs I know of. They are useful for finding leaked personal data☆22Jul 18, 2021Updated 4 years ago
- Google's list of Certificate Transparency logs as a rust crate for use with sct.rs☆14Feb 17, 2023Updated 3 years ago
- the source code that powered gitlive.net☆11Feb 12, 2016Updated 10 years ago
- Passivedns monitor implementation in Rust.☆12Apr 21, 2016Updated 10 years ago
- List of websites to search for court documents in different countries☆25Jun 1, 2022Updated 3 years ago
- A curated blocklist of Autonomous System Numbers (ASNs) associated with VPN providers, datacenters, and hosting services commonly used fo…☆25Mar 11, 2026Updated 2 months ago
- External twitter feeder for AIL framework☆16Apr 16, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- CocktailParty is a data broker system based on phoenix framework☆23Apr 23, 2025Updated last year
- ☆23Mar 12, 2025Updated last year
- OSINT: come iniziare. Strumenti e idee per raccogliere e analizzare fonti aperte.☆11Mar 28, 2021Updated 5 years ago
- Base45☆22Feb 20, 2026Updated 3 months ago
- An analysis of the released data on FinCrime Files transactions as depicted on SARs.☆10Nov 26, 2020Updated 5 years ago
- Python SDK and CLI utility for searchcode.com.☆10Updated this week
- Network scan tool for host and service discovery. Written in Rust.☆22Feb 17, 2026Updated 3 months ago
- A tool for collection archival slivers of the web and web archives☆17Feb 18, 2025Updated last year
- CyCAT.org API back-end server including crawlers☆29Feb 4, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Quick Cache and Archive search buttons☆39May 11, 2024Updated 2 years ago
- GraphQLmap is a scripting engine to interact with a graphql endpoint for pentesting purposes. - Do not use for illegal testing ;)☆15Mar 11, 2024Updated 2 years ago
- Twitch Streamer Analysis, see Twitchverse https://towardsdatascience.com/twitchverse-a-network-analysis-of-twitch-universe-using-neo4j-gr…☆18Oct 25, 2024Updated last year
- A collection of cyberchef recipes for use in osint investigations☆14Jul 2, 2022Updated 3 years ago
- this project can extract contact email address from many site.☆12Sep 26, 2021Updated 4 years ago
- Certificate Transparency Log client suitable for monitoring, quick SCT validation, gossiping, etc.☆24Feb 13, 2021Updated 5 years ago
- Tool to use Nmap, in Flask with different types of scans. 👁☆14Mar 12, 2026Updated 2 months ago
- A Python implementation of our efficient Bloom filter library.☆29Feb 27, 2020Updated 6 years ago
- python exploit for werkzeug debug shell command execution☆10Jun 26, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- D4 core software (server and sample sensor client)☆43Dec 23, 2023Updated 2 years ago
- Registry of data portals, catalogs, data repositories including data catalogs dataset and catalog description standard☆56Feb 24, 2026Updated 3 months ago
- Pythonic way to work with the warning lists defined there: https://github.com/MISP/misp-warninglists☆36May 11, 2026Updated 2 weeks ago
- Convert IP addresses to emojis☆14Jan 11, 2023Updated 3 years ago
- Automate vulnerability triage which prioritizes remediation over discovery☆21May 17, 2026Updated last week
- Magnifier is a simple python script to Information Gathering☆43Jul 12, 2022Updated 3 years ago
- Telegram cybersecurity channels.☆20Oct 27, 2025Updated 6 months ago