metawarc: a command-line tool for metadata extraction from files from WARC (Web ARChive)
☆34Oct 27, 2025Updated 8 months ago
Alternatives and similar repositories for metawarc
Users that are interested in metawarc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tool for automated processing of disk images in BitCurator☆26Apr 13, 2026Updated 2 months ago
- A list of hashtags that bots automatically retweet. Use them to increase the reach of your tweets and increase the number of followers on…☆16Dec 13, 2021Updated 4 years ago
- Crawler that retrieves commoncrawl's crawled hosts and their corresponding IPs☆21Sep 1, 2025Updated 10 months ago
- A collection of data fetchers, and simple quarterly and yearly CVE forecasting models.☆50Oct 1, 2025Updated 9 months ago
- DomainsProject.org HTTP worker☆25Dec 11, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A UserScript to detect GPT generated comments on Hackernews.☆13Dec 10, 2022Updated 3 years ago
- genAI agent providing security context, tooling for performing security analysis on CVE, components and more☆30Jun 27, 2026Updated last week
- Simple, fast dictionary-based language detector for short texts.☆21Feb 5, 2026Updated 4 months ago
- Napkin is a simple tool to produce statistical analysis of a text☆12Feb 25, 2024Updated 2 years ago
- Quick and dirty date parsing Python library to parse HTML dates really fast☆22Jan 3, 2026Updated 6 months ago
- html,css☆12Sep 24, 2021Updated 4 years ago
- the source code that powered gitlive.net☆11Feb 12, 2016Updated 10 years ago
- Passivedns monitor implementation in Rust.☆12Apr 21, 2016Updated 10 years ago
- List of websites to search for court documents in different countries☆25Jun 1, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- External twitter feeder for AIL framework☆16Apr 16, 2023Updated 3 years ago
- Proof of Concept OSINT visualization☆12Dec 29, 2017Updated 8 years ago
- TikTok Scraper. Download video posts, collect user/trend/hashtag/music feed metadata, sign URL and etc.☆48Dec 19, 2021Updated 4 years ago
- s3 as a datastore: A way to use S3 as a key-value datastore instead of a real datastore. can be read as s3aadatastore☆13Mar 16, 2023Updated 3 years ago
- CocktailParty is a data broker system based on phoenix framework☆23Apr 23, 2025Updated last year
- The Brandefense cyber threat intelligence team is always researching new threats and writing research reports. Our latest Threat Reports …☆23Oct 1, 2025Updated 9 months ago
- ☆23Mar 12, 2025Updated last year
- Base45☆22Feb 20, 2026Updated 4 months ago
- An analysis of the released data on FinCrime Files transactions as depicted on SARs.☆10Nov 26, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- IIIF experiments with Gallica content☆32Nov 16, 2025Updated 7 months ago
- Python SDK and CLI utility for searchcode.com.☆10Jun 20, 2026Updated 2 weeks ago
- CyCAT.org API back-end server including crawlers☆29Feb 4, 2023Updated 3 years ago
- Quick Cache and Archive search buttons☆38May 11, 2024Updated 2 years ago
- Twitch Streamer Analysis, see Twitchverse https://towardsdatascience.com/twitchverse-a-network-analysis-of-twitch-universe-using-neo4j-gr…☆18Oct 25, 2024Updated last year
- A social media analytical tool that provides the useful insights and meaningful stats that helps in the enhancing your social media prese…☆11Oct 21, 2020Updated 5 years ago
- Open source & free software that measures the compliance of any webpage with the #GDPR by analyzing its source code and its behaviour. #p…☆17Dec 8, 2022Updated 3 years ago
- Here you find the complete list of enrichments and extractionsfor Ubikron.☆38Mar 6, 2026Updated 3 months ago
- A collection of cyberchef recipes for use in osint investigations☆14Jul 2, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Converts binary files of 1C (1CD, cf, epf, efd, etc.) to grepable CSV☆14Feb 12, 2024Updated 2 years ago
- Tools and resources that may be useful to you when conducting investigations related to Islamic Republic of Iran☆23Sep 10, 2025Updated 9 months ago
- this project can extract contact email address from many site.☆12Sep 26, 2021Updated 4 years ago
- Certificate Transparency Log client suitable for monitoring, quick SCT validation, gossiping, etc.☆24Feb 13, 2021Updated 5 years ago
- A Python implementation of our efficient Bloom filter library.☆29Feb 27, 2020Updated 6 years ago
- Collections of services for search data from passengers lists and emigrants records☆12Jun 3, 2022Updated 4 years ago
- Pythonic way to work with the warning lists defined there: https://github.com/MISP/misp-warninglists☆36May 11, 2026Updated last month