scrapedia/scrapy-pipelines

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/scrapedia/scrapy-pipelines)

scrapedia / scrapy-pipelines

A collection of pipelines for Scrapy

☆16

Alternatives and similar repositories for scrapy-pipelines

Users that are interested in scrapy-pipelines are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mrt-kousha / scrapy
View on GitHub
In this repository, I try to share some of the little tips and tricks and amazing spiders that I used to work with on the scrapy framewor…
☆12Feb 2, 2020Updated 6 years ago
scrapedia / r18
View on GitHub
A scrapy spider for R18
☆16Updated this week
stummjr / scrapy-fieldstats
View on GitHub
A Scrapy extension to log items coverage when the spider shuts down
☆18Apr 11, 2020Updated 6 years ago
randName / scrapy-jav
View on GitHub
JAV site scrapers
☆19Jul 6, 2022Updated 4 years ago
alias454 / graylog-zeek-content-pack
View on GitHub
BRO/Zeek IDS content pack contains pipeline rules, a stream, a dashboard displaying interesting activity, and a syslog tcp input to captu…
☆19Apr 12, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
scrapedia / scrapy-useragents
View on GitHub
A downloader middleware to change user-agent of scrapy
☆21Apr 13, 2026Updated 3 months ago
FavyTeam / Advanced_PHP_Scrapping
View on GitHub
Enhanment Scrapping API for six hotel booking website from Expedia.com, Booking.com, Bookhotelbeds.com. Hotels.com, Bestday.com, despegar…
☆11May 7, 2018Updated 8 years ago
scravy / pysparkextra
View on GitHub
☆10Jun 29, 2021Updated 5 years ago
mustafaaljadery / mlxcli
View on GitHub
Run large models from the terminal using Apple MLX.
☆32Mar 18, 2024Updated 2 years ago
IshitaTakeshi / TruthFinder
View on GitHub
TruthFinder finds true facts from a large amount of conflicting information
☆18May 9, 2018Updated 8 years ago
swami-mahesh / Media-Organizer
View on GitHub
A library and command line tool utility to organise your media ( Photos and videos ) better
☆16Nov 15, 2018Updated 7 years ago
RdoubleA / DWIinpainting
View on GitHub
Reconstructing cropped data in 3D diffusion MRI images using variational autoencoders
☆11Aug 3, 2020Updated 5 years ago
miha-stopar / extract-repetitions
View on GitHub
Extract (DOM tree) repetitions from a webpage
☆11Jan 13, 2014Updated 12 years ago
pawal / torrent-tracker
View on GitHub
Track torrent-trackers
☆20Oct 3, 2012Updated 13 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
ownport / aria2c-as-a-service
View on GitHub
The collection of scripts and tools for running aria2c as a service for local install
☆14Apr 4, 2014Updated 12 years ago
jakop345 / strike-search
View on GitHub
The awful code behind the most beautiful torrent site to ever exist.
☆11Jan 28, 2016Updated 10 years ago
schubergphilis / data-migrator
View on GitHub
A declarative data-migration package
☆16Dec 7, 2024Updated last year
simonw / llm-templates-github
View on GitHub
Research prototype for new register_template_loaders LLM plugin hook
☆19Apr 7, 2025Updated last year
IaroslavR / scrapy-mysql-pipeline
View on GitHub
scrapy mysql pipeline
☆49Jan 15, 2022Updated 4 years ago
WebScrapingSolutions / scrapy-project-template
View on GitHub
Scrapy project template. Use it to quickly spin up a new web scraping project
☆16Nov 18, 2024Updated last year
cevoaustralia / glue-vscode
View on GitHub
Local Development of AWS Glue with Docker and Visual Studio Code
☆14Nov 29, 2021Updated 4 years ago
kyegomez / MGQA
View on GitHub
The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Model…
☆17Dec 11, 2023Updated 2 years ago
chauhan01 / Captcha-text-extraction
View on GitHub
Extracting captcha text using CNN
☆11Feb 12, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Tiago-Lira / scrapyd-mongodb
View on GitHub
Library designed to replace the SQLite backend by a MongoDB backend on Scrapy queue management
☆17Sep 2, 2017Updated 8 years ago
thomasleplus / gphotos-archive
View on GitHub
Archiving tool for Google Photos
☆16Updated this week
xuwei517 / bigdata
View on GitHub
大数据技术及架构图解实战派-书籍配套代码
☆13Sep 2, 2022Updated 3 years ago
scrapy / pypydispatcher
View on GitHub
A fork of http://pydispatcher.sourceforge.net/ with PyPy support
☆16Jul 3, 2017Updated 9 years ago
skhaz / piragamedev-joystick
View on GitHub
My prototype of a USB-connected joystick that uses HID protocol, written in C using Atmega microcontrollers.
☆10Nov 5, 2022Updated 3 years ago
quantiota / AI-Agent-Host
View on GitHub
☆33Jul 2, 2026Updated 3 weeks ago
scrapehero / walmart-coupons
View on GitHub
Walmart Web Scraper written in Python 3 to extract coupon details for a store location
☆14Mar 21, 2018Updated 8 years ago
aws-samples / redshift-streaming-ingestion-patterns
View on GitHub
This is a collecton of CDK projects to show how to load data from streaming services into Amazon Redshift.
☆13Sep 10, 2024Updated last year
scrapy-plugins / scrapy-splitvariants
View on GitHub
Scrapy spider middleware to split an item into multiple items using a multi-valued key
☆21Feb 8, 2017Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
scrapy / scrapy-bench
View on GitHub
A CLI for benchmarking Scrapy.
☆32Jun 28, 2025Updated last year
Darklyter / StashVideohasherNode
View on GitHub
☆16Apr 2, 2026Updated 3 months ago
roee88 / meta4kodi
View on GitHub
MetaTeam Kodi stuff
☆18Dec 3, 2017Updated 8 years ago
guanfangdong / pytorch-frequency-regularization
View on GitHub
☆17Nov 7, 2023Updated 2 years ago
hgiasac / hasura-oss-read-replicas-demo
View on GitHub
Hasura OSS read replicas demo
☆10Dec 21, 2020Updated 5 years ago
big-o / cherrypicker
View on GitHub
A Python package for extracting flat tables of data from complex structures.
☆12Jul 23, 2021Updated 5 years ago
adamjakab / Plex-NFO-Import-Agent
View on GitHub
Plex import agent using .nfo descriptor files as source.
☆19Jan 19, 2023Updated 3 years ago