flaiming / Domain-Parking-SensorsLinks
Extracts features from web pages to determine whether the domain is parked
☆14Updated 3 years ago
Alternatives and similar repositories for Domain-Parking-Sensors
Users that are interested in Domain-Parking-Sensors are comparing it to the libraries listed below
Sorting:
- The original dataset for my 2013 article on Twitter's network patterns☆50Updated 8 years ago
- A list of over 5000 US news domains and their social media accounts☆45Updated 2 years ago
- List of entity resolution software and resources.☆75Updated 4 months ago
- scraper for facebook, gab, google and tiktok☆21Updated last week
- ☆16Updated last year
- 👀 Analyze Websites and Resources They Request☆23Updated 6 years ago
- A helper library full of URL-related heuristics.☆69Updated 3 weeks ago
- Statistical WHOIS parser☆10Updated 8 years ago
- Now included in rigour☆151Updated last month
- A repository demonstrating the use of real-estate-scrape to store the estimated value of a property on Redfin and Zillow every night usin…☆34Updated this week
- R package for working with data stored within VERIS framework☆13Updated 9 years ago
- Python hashlib-like wrapper for several fuzzy hash algorithms.☆14Updated 2 years ago
- 🌬️urlExpander is a Python package for expanding shortened links (urls).☆74Updated 2 years ago
- CommonCrawl keyword scanner. Time for month of CC data on EC2 c5.18xlarge instance for hundreds of keywords takes about 3 hours. LLM (BER…☆15Updated 2 years ago
- LinkedinScrapper searches linkedin pages using executive’s name, title, firm and get his / her best matching linked profile and extracts …☆9Updated 8 years ago
- Capture a URL with Playwright☆30Updated last week
- Links to resources on misinformation, disinformation, fake news, whatever it's called this week☆52Updated 3 years ago
- A semantic food search web application built with Django, Solr, SBERT, and Docker☆10Updated 2 months ago
- A command line tool to cluster html pages based on structural and style similarity.☆20Updated 2 weeks ago
- p0f v3 with impersonation spoofing, written in Python - Accurately guess the OS of a packet with passive fingerprinting.☆61Updated last year
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.☆70Updated 4 years ago
- A maximum-strength name parser for record linkage.☆37Updated last week
- Toolchain to retrieve and parse privacy policies from websites as described in our paper "Unifying Privacy Policy Detection" by Henry Hos…☆17Updated 2 months ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 6 years ago
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆62Updated this week
- Given a set of URLs, this packages detects coordinated link sharing behavior on social media and outputs the network of entities that per…☆75Updated 10 months ago
- Pre-built template for using newspaper3k on aws lambda☆17Updated 2 years ago
- API for OpenSanctions with support for entity search and bulk matching of data collections. Supports Reconciliation API spec.☆86Updated this week
- Search for PII in Python☆29Updated last year
- An R package for implementing augmented network log anomaly detection procedures☆22Updated 5 years ago