flaiming / Domain-Parking-SensorsLinks
Extracts features from web pages to determine whether the domain is parked
☆14Updated 3 years ago
Alternatives and similar repositories for Domain-Parking-Sensors
Users that are interested in Domain-Parking-Sensors are comparing it to the libraries listed below
Sorting:
- A list of over 5000 US news domains and their social media accounts☆44Updated 2 years ago
- scraper for facebook, gab, google and tiktok☆21Updated last month
- List of entity resolution software and resources.☆78Updated 5 months ago
- Query 'GreyNoise Intelligence 'API' in R☆14Updated 5 years ago
- A helper library full of URL-related heuristics.☆70Updated last month
- ☆11Updated 6 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 6 years ago
- A maximum-strength name parser for record linkage.☆37Updated last month
- Given a set of URLs, this packages detects coordinated link sharing behavior on social media and outputs the network of entities that per…☆75Updated 11 months ago
- 🌬️urlExpander is a Python package for expanding shortened links (urls).☆75Updated 2 years ago
- Browser extension to simulate browsing behaviour in search engines.☆31Updated last week
- Tools to Obtain and Work with Cloud Provider CIDR Blocks in R☆17Updated 6 years ago
- Query the 'PublicWWW' Source Code Search Engine in R☆13Updated 7 years ago
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆142Updated 6 months ago
- 👀 Analyze Websites and Resources They Request☆23Updated 6 years ago
- Classifying the content of domains☆56Updated 2 years ago
- Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.☆66Updated 3 weeks ago
- Now included in rigour☆151Updated 2 months ago
- Deduplicate and parse list of `dirty names'☆23Updated 4 years ago
- Predict the Race of a Given Surname Using Census Data☆12Updated 2 years ago
- Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.☆75Updated last year
- A command line tool to cluster html pages based on structural and style similarity.☆20Updated 2 weeks ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 9 months ago
- 📊 Repository for the study on 11.8 Million Google Search Results☆26Updated 5 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Inspect a URL and estimate if it contains a news story☆39Updated 8 months ago
- Public client for consuming content from the Media Cloud Online News Archive & Directory.☆77Updated last month
- Exploring internet domain names with deep learning using vector embeddings☆20Updated 6 years ago
- Probabilistic Record Linkage Using Pretrained Text Embeddings☆14Updated 2 weeks ago
- Group thousands of similar spreadsheet or database text entries in seconds☆156Updated 2 years ago