NikolaiT/se-scraper

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NikolaiT/se-scraper)

NikolaiT / se-scraper

Javascript scraping module based on puppeteer for many different search engines...

☆571

Alternatives and similar repositories for se-scraper

Users that are interested in se-scraper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NikolaiT / GoogleScraper
View on GitHub
A Python module to scrape several search engines (like Google, Yandex, Bing, Duckduckgo, ...). Including asynchronous networking support.
☆2,868Jul 3, 2021Updated 5 years ago
NikolaiT / Crawling-Infrastructure
View on GitHub
Distributed crawling infrastructure running on top of severless computation, cloud storage (such as S3) and sophisticated queues.
☆438Dec 30, 2022Updated 3 years ago
NikolaiT / scrapeulous
View on GitHub
Cloud crawler functions for scrapeulous
☆44Feb 24, 2021Updated 5 years ago
NikolaiT / struktur
View on GitHub
Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.
☆70Jun 8, 2021Updated 5 years ago
NikolaiT / stealthy-scraping-tools
View on GitHub
Minimal set of tools to conduct stealthy scraping.
☆166Apr 21, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
tasos-py / Search-Engines-Scraper
View on GitHub
Search google, bing, yahoo, and other search engines with python
☆670Apr 2, 2025Updated last year
ecoron / SerpScrap
View on GitHub
SEO python scraper to extract data from major searchengine result pages. Extract data like url, title, snippet, richsnippet and the type …
☆271Apr 21, 2026Updated 3 months ago
ncouture / python-search-engine
View on GitHub
Search engine base (crawler, indexer and parser) using Python, Celery, RabbitMQ, CouchDB and Whoosh.
☆10Jun 10, 2025Updated last year
berstend / puppeteer-extra
View on GitHub
💯 Teach puppeteer new tricks through plugins.
☆7,386Jul 18, 2024Updated 2 years ago
jroakes / screaming-frog-shingling
View on GitHub
Uses Screaming Frog Internal HTML with text extraction along with a shingling algorithm to compare content duplication across the pages o…
☆47Oct 2, 2019Updated 6 years ago
FOGSEC / free-online-competitive-intelligence
View on GitHub
🎉 40+ Online FREE Competitive Intelligence Tools List
☆19Mar 8, 2018Updated 8 years ago
MarioVilas / googlesearch
View on GitHub
Google search from Python (unofficial).
☆1,251Nov 11, 2025Updated 8 months ago
sethblack / python-seo-analyzer
View on GitHub
An SEO tool that analyzes the structure of a site, crawls the site, count words in the body of the site and warns of any technical SEO is…
☆1,463Updated this week
itemsapi / elasticitems
View on GitHub
Higher level client for Elasticsearch written in Node.js oriented on facets and simplicity
☆20Aug 30, 2025Updated 10 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
serp-spider / search-engine-google
View on GitHub
Google client for SERPS
☆168May 15, 2024Updated 2 years ago
paulirish / headless-cat-n-mouse
View on GitHub
Is headless chrome currently detectable? Let's pit the detections and detection evasions against eachother.
☆661Jun 5, 2021Updated 5 years ago
eliasdabbas / advertools
View on GitHub
advertools - online marketing productivity and analysis tools
☆1,423Jun 30, 2026Updated 3 weeks ago
thibauts / node-google-search-scraper
View on GitHub
Google search scraper with captcha solving support
☆88Oct 23, 2019Updated 6 years ago
istresearch / scrapy-cluster
View on GitHub
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
☆1,226Nov 7, 2023Updated 2 years ago
sonumja / NLP_ArticleSpinner
View on GitHub
Simple NLP Article Spinner algorithm
☆12Aug 30, 2018Updated 7 years ago
constverum / ProxyBroker
View on GitHub
Proxy [Finder | Checker | Server]. HTTP(S) & SOCKS
☆4,158Mar 18, 2024Updated 2 years ago
BruceDone / awesome-crawler
View on GitHub
A collection of awesome web crawler,spider in different languages
☆7,257Jun 16, 2024Updated 2 years ago
Cuadrix / puppeteer-page-proxy
View on GitHub
Additional module to use with 'puppeteer' for setting proxies per page basis.
☆449Jun 9, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
openeventdata / UniversalPetrarch
View on GitHub
Language-agnostic political event coding using universal dependencies
☆18Jun 4, 2019Updated 7 years ago
brendonboshell / supercrawler
View on GitHub
A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and con…
☆381Dec 30, 2022Updated 3 years ago
shyamupa / xelms
View on GitHub
☆19Dec 19, 2018Updated 7 years ago
clemfromspace / scrapy-puppeteer
View on GitHub
Scrapy + Puppeteer
☆110Jun 11, 2021Updated 5 years ago
scrapinghub / frontera
View on GitHub
A scalable frontier for web crawlers
☆1,332Jun 6, 2025Updated last year
jsphpl / redirect-mapper
View on GitHub
Generate a redirect map from two sitemaps for website migration.
☆13May 4, 2018Updated 8 years ago
ttlns / brotector
View on GitHub
An advanced antibot for webdrivers
☆290Dec 3, 2024Updated last year
harismuneer / Ultimate-Social-Scrapers
View on GitHub
🤖 Top-rated tools to scrape all major public sections from Facebook, Instagram, and Twitter (X) including posts (likes/comments), photos…
☆3,144Jun 7, 2025Updated last year
adamisntdead / poke
View on GitHub
A simple tool to check your site for broken links, media, iframes, stylesheets, scripts, forms or metadata.
☆25Apr 12, 2018Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sahava / web-scraper-gcp
View on GitHub
Scrape all the pages and links of a given domain and write the results to Google Cloud BigQuery.
☆39Sep 7, 2020Updated 5 years ago
scrapoxy / scrapoxy
View on GitHub
Scrapoxy has been discontinued.
☆2,415Feb 7, 2026Updated 5 months ago
DanMcInerney / search-google
View on GitHub
Scrape google search results
☆94Aug 24, 2018Updated 7 years ago
scrapinghub / article-extraction-benchmark
View on GitHub
Article extraction benchmark: dataset and evaluation scripts
☆377May 29, 2026Updated 2 months ago
intoli / user-agents
View on GitHub
A JavaScript library for generating random user agents with data that's updated daily.
☆1,184Updated this week
scrapinghub / portia
View on GitHub
Visual scraping for Scrapy
☆9,505Jun 26, 2024Updated 2 years ago
scrapinghub / spidermon
View on GitHub
Scrapy Extension for monitoring spiders execution.
☆562May 28, 2026Updated 2 months ago