Scrapy spider middleware to clean up query parameters in request URLs
☆24Jun 30, 2016Updated 9 years ago
Alternatives and similar repositories for scrapy-querycleaner
Users that are interested in scrapy-querycleaner are comparing it to the libraries listed below
Sorting:
- Scrapy spider middleware to split an item into multiple items using a multi-valued key☆21Feb 8, 2017Updated 9 years ago
- A scrapy extension to store requests and responses information in storage service☆27Mar 11, 2022Updated 3 years ago
- Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.☆57Mar 16, 2022Updated 3 years ago
- ☆19Oct 12, 2016Updated 9 years ago
- ☆29Apr 28, 2021Updated 4 years ago
- small fastcdc implementation in c99☆18Dec 31, 2022Updated 3 years ago
- Scrapy spider middleware to ignore requests to pages containing items seen in previous crawls☆277Feb 26, 2025Updated last year
- A simple algorithm for clustering web pages, suitable for crawlers☆35Mar 6, 2017Updated 8 years ago
- ☆10Aug 2, 2019Updated 6 years ago
- An efficient simhash implementation for python☆127Oct 25, 2019Updated 6 years ago
- A scrapy extension to sync `.scrapy` folder to an S3 bucket☆18Mar 28, 2022Updated 3 years ago
- Scrapy schema validation pipeline and Item builder using JSON Schema☆45Mar 26, 2021Updated 4 years ago
- cli for evaluating css and xpath selectors☆29Jul 4, 2023Updated 2 years ago
- Paginating the web☆37Feb 11, 2014Updated 12 years ago
- Common humanization utilities for Flask applications☆29Jan 20, 2022Updated 4 years ago
- Applicativo per la gestione e l'ottimizzazione degli acquisti dei Gruppi di acquisto Solidali (G.A.S.)☆12Oct 21, 2018Updated 7 years ago
- MongoDB extensions for Scrapy☆44Oct 2, 2014Updated 11 years ago
- Scrapy extension to control spiders using JSON-RPC☆300Aug 26, 2019Updated 6 years ago
- ☆143Nov 24, 2015Updated 10 years ago
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40May 21, 2024Updated last year
- Turkish Lemmatizer is used for finding root form of Turkish words.☆12Nov 30, 2013Updated 12 years ago
- ☆12Sep 4, 2021Updated 4 years ago
- A simple shell script with wizard to get you OpenWRT for Proxmox.☆11Oct 16, 2021Updated 4 years ago
- A Simple tool to get good and checked proxies.☆11Sep 1, 2021Updated 4 years ago
- django-filter for MongoEngine☆13Jul 12, 2024Updated last year
- personal synchronization application - based on git☆17Apr 6, 2012Updated 13 years ago
- ☆13Jan 5, 2023Updated 3 years ago
- Our Game Engine☆10Dec 1, 2016Updated 9 years ago
- ☆12Mar 21, 2012Updated 13 years ago
- use pusher☆27Jul 20, 2014Updated 11 years ago
- small tornado project of an imageboard with very bad and outdated code. use branch develop☆10Dec 8, 2022Updated 3 years ago
- Next generation linbo☆12Jan 31, 2026Updated last month
- Excel to Json online converter made with Python/Flask and React.js☆12Jan 4, 2023Updated 3 years ago
- Stor2rrd Grafan monitoring☆12Jan 8, 2019Updated 7 years ago
- Generates a YouTube playlist from a list of URLs.☆10Aug 14, 2023Updated 2 years ago
- ☆10Nov 18, 2021Updated 4 years ago
- Tutorial on how to create a twitter bot that replied to mentions☆10Sep 16, 2023Updated 2 years ago
- Dedup and compress your device mapper devices. Works especially well with thin provisioning.☆10Dec 4, 2025Updated 3 months ago
- 通过Shizuku授权,实现修改部分系统设置项。☆17Apr 1, 2024Updated last year