matejbasic/PythonScrapyBasicSetup

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/matejbasic/PythonScrapyBasicSetup)

matejbasic / PythonScrapyBasicSetup

Basic setup with random user agents and IP addresses for Python Scrapy Framework.

☆56

Alternatives and similar repositories for PythonScrapyBasicSetup

Users that are interested in PythonScrapyBasicSetup are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

IaroslavR / scrapy-mysql-pipeline
View on GitHub
scrapy mysql pipeline
☆49Jan 15, 2022Updated 4 years ago
wemake-services / wemake-django-rest
View on GitHub
Create Django REST APIs the right way, no magic intended
☆11Dec 8, 2022Updated 3 years ago
mmas / docker-scrapy-tor
View on GitHub
Scrapy environment with Tor for anonymous ip routing and Privoxy for http proxy
☆20Jul 5, 2016Updated 10 years ago
stummjr / scrapy-fieldstats
View on GitHub
A Scrapy extension to log items coverage when the spider shuts down
☆18Apr 11, 2020Updated 6 years ago
alecxe / scrapy-fake-useragent
View on GitHub
Random User-Agent middleware based on fake-useragent
☆688Sep 18, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
qwefgh90 / SeleniumSample
View on GitHub
a set of samples about Login & Cookie with PhantomJS
☆14Jul 30, 2015Updated 10 years ago
miku-sama / shiraboot
View on GitHub
一个booter网页端,基友半路甩锅的项目.
☆10Feb 8, 2017Updated 9 years ago
KayneWest / DeepSpeech
View on GitHub
project trying to replicate http://arxiv.org/pdf/1412.5567v2.pdf
☆12Mar 22, 2015Updated 11 years ago
scrapinghub / scrapy-training
View on GitHub
Scrapy Training companion code
☆173Jan 30, 2019Updated 7 years ago
Tai7sy / ico-spider
View on GitHub
ICO Source Spider, write in NodeJS
☆12May 4, 2018Updated 8 years ago
Germey / ScrapyTutorial
View on GitHub
Scrapy Tutorial
☆11Feb 19, 2017Updated 9 years ago
sky-ecosystem / spells-kovan
View on GitHub
☆12Sep 1, 2021Updated 4 years ago
etherceo1x1 / codes
View on GitHub
BUILD YOUR OWN BLOCKCHAIN: A PYTHON TUTORIAL Download the full Jupyter/iPython notebook from Github here Build Your Own Blockchain – The…
☆19Jun 15, 2019Updated 7 years ago
android-hacker / WechatLuckyMoney
View on GitHub
WechatLuckyMoney(微信红包插件)
☆10Feb 12, 2018Updated 8 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
edsonlb / botScraping
View on GitHub
(Python) Collect data directly from online retails stores for public data mining process.
☆48Jul 18, 2015Updated 11 years ago
rdcprojects / scrapy-mq-redis
View on GitHub
A RabbitMQ/Redis tool for Scrapy
☆13Oct 7, 2016Updated 9 years ago
ComputationalFinanceTools / fst-cuda-option-pricing
View on GitHub
Pricing European and American options with jump models using CUDA on the GPU
☆12Apr 12, 2016Updated 10 years ago
Gingerbreadfork / Cutlery
View on GitHub
Python Script for Copywriters to Gather Data from Competing Content and Find Keyword Overlap
☆15Apr 23, 2022Updated 4 years ago
haseemajaz / Google-Indexing-API-Publisher
View on GitHub
Python script designed to simplify the process of submitting URLs to Google's Indexing API for faster and more efficient website indexing…
☆12Sep 12, 2023Updated 2 years ago
shaunvxc / jsonlike
View on GitHub
little hack for when json.loads() complains
☆12Jul 29, 2017Updated 8 years ago
schemaorg / sdopythonapp
View on GitHub
Original schema.org python-appengine codebase
☆19Apr 10, 2022Updated 4 years ago
Atohallan / chatgpt-ui
View on GitHub
A ChatGPT web client that supports multiple users, multiple languages, and multiple database connections for persistent data storage. Pro…
☆13May 19, 2023Updated 3 years ago
teal33t / poopak
View on GitHub
POOPAK - TOR Hidden Service Crawler
☆140Nov 15, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kendricktan / ethtxd
View on GitHub
Ethereum Tx Decoder
☆15Aug 17, 2021Updated 4 years ago
ZeframLou / mev-token
View on GitHub
☆14May 29, 2021Updated 5 years ago
jamboree / act
View on GitHub
ASIO Cooperative Task for await-based coroutine
☆16Sep 8, 2018Updated 7 years ago
thomasballinger / talkingtoothercomputers
View on GitHub
☆19Jul 26, 2017Updated 8 years ago
povilasb / scrapy-html-storage
View on GitHub
Scrapy downloader middleware that stores response HTMLs to disk.
☆18Apr 14, 2026Updated 3 months ago
artistic709 / ImpermanentGain
View on GitHub
The antiparticle of impermanent loss
☆15May 23, 2022Updated 4 years ago
scrapy-plugins / scrapy-zyte-smartproxy
View on GitHub
Zyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy
☆363May 4, 2026Updated 2 months ago
pablonlr / arbibot
View on GitHub
Arbitrage bot to get profits from multiple exchanges (cex and dex) developed in Go
☆12Sep 11, 2021Updated 4 years ago
IlyasHabeeb / Machine_Learning_Focused_Crawler
View on GitHub
A focused web crawler that uses Machine Learning to fetch better relevant results.
☆13Jan 12, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ali-alharthi / WP-Attacker
View on GitHub
In a simple word, it is an "Automatic WP Exploiter".
☆17Dec 26, 2016Updated 9 years ago
scrapinghub / scrapy-poet
View on GitHub
Page Object pattern for Scrapy
☆127Jun 8, 2026Updated last month
passionweb-manuel-schnabel / ai-seo-helper
View on GitHub
Generates SEO metadata based on content using AI. Currently several metadata for pages and articles of EXT:news can be generated using an…
☆17Sep 9, 2025Updated 10 months ago
plandes / mednlp
View on GitHub
Medical natural language parsing and utility library
☆15Dec 10, 2025Updated 7 months ago
JoelNiklaus / LegalDatasets
View on GitHub
This repository serves as a collection of scrapers procuring and structuring various legal datasets
☆19Jun 16, 2023Updated 3 years ago
queryverse / ParquetFiles.jl
View on GitHub
FileIO.jl integration for Parquet files
☆19Jul 26, 2022Updated 3 years ago
akinorioyama / live-caption-analytics
View on GitHub
Send captions to servers to perform analytics and return feedback to the sender
☆15Jan 7, 2025Updated last year