shavit / crawlero

Distributed web crawlers. Fault tolerance, user-agent randomizer, RabbitMQ, Tor, PostgreSQL.

☆16

Alternatives and similar repositories for crawlero:

Users that are interested in crawlero are comparing it to the libraries listed below

carlsednaoui / google-seo-allintitle-scraper
Google SEO scraper for "allintitle:keyword" queries.
☆23Updated 10 years ago
matoous / gosmtp
Golang implementation of full-featured, lightweight and RFC compliant SMTP server.
☆10Updated 4 years ago
ChrisCates / CommonCrawler
🕸 A simple way to extract data from Common Crawl
☆34Updated 5 years ago
mchmarny / tsignal
Analyzing social media sentiment and its impact on stock market
☆39Updated last year
machinebox / textbox_elastic_indexer
Example how to pre-process news articles with textbox and index on Elastic Search
☆13Updated 7 years ago
MontFerret / ferret-server
Advanced declarative web scraping
☆30Updated 2 years ago
EdmundMartin / SearchScraperAPI
Aiohttp web server API, which scrapes Google and returns scrape results as response. Supports proxies, multiple geos and number of result…
☆56Updated last year
TeamHG-Memex / sitehound-frontend
Site Hound (previously THH) is a Domain Discovery Tool
☆23Updated 3 years ago
Supme / smtpSender
build and send email
☆17Updated 2 years ago
gfjreg / CommonCrawl
A distributed system for mining common crawl using SQS, AWS-EC2 and S3
☆18Updated 10 years ago
haridas / IP-monitoring
Monitor your IP reputation for Email sending or Email marketing.
☆44Updated 11 years ago
rootVIII / proxy_web_crawler
Automates the process of repeatedly searching for a website via scraped proxy IP and search keywords
☆44Updated last year
unixpickle / essentials
Things I wish were Go built-ins
☆13Updated 4 years ago
jpillora / docker-cloud-torrent-openvpn
cloud-torrent and OpenVPN in a docker container
☆17Updated 7 years ago
uploadcare / email-list-verify
verify every email in file
☆16Updated 6 years ago
CI-Research / KeywordAnalysis
Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends
☆56Updated last year
dutchcoders / db2es
MySQL export to Elasticsearch
☆14Updated 8 years ago
lanycrost / whois-server
WHOIS server builded in Golang and using Postgres as a DB.
☆11Updated 6 years ago
CoinCulture / claw
Command Line Law - Contract management for developers | lawyers
☆25Updated 6 years ago
sangaline / email-spy
A browser extension that lets you find email addresses for any domain with a single click.
☆71Updated 7 years ago
tal95shah / LinkedIn_Scraper
A Selenium based automated program that scrapes profiles data,stores in CSV,follows them and saves their profile in PDF.
☆32Updated last year
joelparkerhenderson / big-five-personality-traits
Big Five personality traits: domains, aspects, facets
☆25Updated last year
rvalitov / backlink-checker
For SEO: checks a list of web pages if a backlink is present
☆11Updated 5 years ago
binwiederhier / elastictl
Simple tool to import/export Elasticsearch indices into a file, and/or reshard an index
☆19Updated 3 years ago
zaosoula / email-spam-domains
A list of 94,548 spam or temporary email domain
☆12Updated 4 years ago
osminogin / tornote
Self-destructing notes on Go with tiny secured client-side
☆29Updated 2 years ago
twintproject / twint-desktop-react
☆14Updated 2 years ago
schollz / googleit
Get results from search engines.
☆12Updated 2 years ago
phantombuster / sdk
Phantombuster's SDK
☆14Updated 5 months ago
robertoarruda / linkedin-public-dir-companies
Crawler and scraper of the public directory of companies on LinkedIn.
☆25Updated 5 years ago