blues-lab / polipyLinks
Library for scraping, parsing, and analyzing privacy policies.
☆17Updated 2 years ago
Alternatives and similar repositories for polipy
Users that are interested in polipy are comparing it to the libraries listed below
Sorting:
- Toolchain to retrieve and parse privacy policies from websites as described in our paper "Unifying Privacy Policy Detection" by Henry Hos…☆17Updated 6 months ago
- Tools to construct and process Common Crawl webgraphs☆99Updated last week
- Statistics of Common Crawl monthly archives mined from URL index files☆193Updated last week
- A list of over 5000 US news domains and their social media accounts☆45Updated 2 years ago
- Common crawl extractor☆80Updated last year
- Detect communities in legal networks☆12Updated 9 months ago
- A browser extension to collect social media data with.☆289Updated this week
- ☆73Updated last week
- Pushshift Telegram Ingest☆86Updated 6 years ago
- A database of courts, tests and other experiments☆93Updated this week
- Given a set of URLs, this packages detects coordinated link sharing behavior on social media and outputs the network of entities that per…☆75Updated last year
- A Python Wrapper To Retrieve Data From The CrowdTangle API☆11Updated 4 months ago
- A helper library full of URL-related heuristics.☆71Updated 2 weeks ago
- 🌬️urlExpander is a Python package for expanding shortened links (urls).☆76Updated 3 years ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆141Updated 2 months ago
- CIB Analysis Data☆20Updated 5 years ago
- A small command line tool and set of functions for studying coordination networks in Twitter and other social media data.☆80Updated 2 years ago
- Frontend component for Hoaxy, a tool to visualize the spread of claims and fact checking☆72Updated 3 years ago
- A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine☆185Updated this week
- Run information flow experiments on the Web☆39Updated 4 years ago
- A database of court reporters, tests and other experiments☆114Updated this week
- A Python module for clustering creators of social media content into networks☆73Updated 3 years ago
- An automated, programming-free web scraper for interactive sites☆111Updated 2 years ago
- Find legal citations in any block of text☆174Updated last week
- Data model and processing tools for investigative entity data☆247Updated this week
- Tools for conducting and parsing web search☆48Updated 3 months ago
- A verification “Swiss army knife” helping journalists, fact-checkers, and human rights defenders to save time and be more efficient in th…☆40Updated this week
- Create a directed network of Twitter followers.☆70Updated 3 years ago
- The AI Incident Database seeks to identify, define, and catalog artificial intelligence incidents.☆205Updated this week
- Comprehensive database of ratings for 11k news domains☆27Updated 2 years ago