monperrus / crawler-user-agentsLinks
Syntactic patterns of HTTP user-agents used by bots / robots / crawlers / scrapers / spiders. pull-request welcome
β1,285Updated 2 months ago
Alternatives and similar repositories for crawler-user-agents
Users that are interested in crawler-user-agents are comparing it to the libraries listed below
Sorting:
- π€/π¨βπ¦° Detect bots/crawlers/spiders using the user agent stringβ1,051Updated last week
- Daily updated repository for https://github.com/disposable/disposableβ514Updated this week
- An open source list of ASNs known to belong to cloud, managed hosting, and colo facilities.β412Updated 5 months ago
- A list of disposable email domainsβ1,348Updated 4 months ago
- The quickest way to run thumbor.β292Updated 4 months ago
- A list of disposable/temporary email address domainsβ1,149Updated 3 weeks ago
- binlist.net data repoβ633Updated last year
- Block bad, possibly even malicious web crawlers (automated bots) using Nginxβ866Updated 4 years ago
- Whois server list for all top level domains.β325Updated 4 years ago
- Easily create XML sitemaps for your website.β435Updated last year
- Community-contributed list of referrer spammers. Comment +1 in any issue or Pull request and the spammer will be added to the list!β674Updated 2 weeks ago
- IP geolocation web serverβ736Updated 3 years ago
- reliable fake and temp email filter solution for site operatorsβ280Updated this week
- Detects ad blockers (AdBlock, ...)β1,905Updated last year
- Nginx Block Bad Bots, Spam Referrer Blocker, Vulnerability Scanners, User-Agents, Malware, Adware, Ransomware, Malicious Sites, with antiβ¦β4,442Updated this week
- This project aims to modify your nginx configuration to let you get the real ip address of your visitors.β708Updated last year
- A list of temporary email providersβ1,133Updated 2 months ago
- JavaScript domain name parser based on the Public Suffix Listβ418Updated 3 months ago
- This is a project for a browser fingerprinting technique that can track users not only within a single browser but also across different β¦β1,268Updated 3 years ago
- Cross-language temporary (disposable/throwaway) email detection library. Covers 55 734+ fake email providers.β1,773Updated this week
- Maxmind GEO Lookupβ627Updated 2 weeks ago
- NGINX module for Brotli compressionβ2,167Updated last year
- Automatic PageSpeed optimization module for Nginxβ4,358Updated 2 years ago
- IP Intelligence is a free Proxy VPN TOR and Bad IP detection tool to prevent Fraud, stolen content, and malicious users. Block proxies, Vβ¦β324Updated 5 months ago
- Apache Block Bad Bots, (Referer) Spam Referrer Blocker, Vulnerability Scanners, Malware, Adware, Ransomware, Malicious Sites, Wordpress Tβ¦β904Updated this week
- Device information and digital fingerprinting written in pure JavaScript.β2,188Updated 2 years ago
- Splits a hostname into subdomains, domain and (effective) top-level domains.β513Updated 5 months ago
- Search Console Archive store your Search Console (Webmaster Tools) data to exceed the 90 days history in a web SEO tool with search & anaβ¦β31Updated 5 years ago
- Scrapoxy is a super proxies manager that orchestrates all your proxies into one place, rather than spreading management across multiple sβ¦β2,309Updated last month
- Rotating TOR proxy with Dockerβ1,182Updated last year