MongoDB extensions for Scrapy
☆44Oct 2, 2014Updated 11 years ago
Alternatives and similar repositories for scmongo
Users that are interested in scmongo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Extensions for using Scrapy on Amazon AWS☆32Dec 5, 2012Updated 13 years ago
- A python library detect and extract listing data from HTML page.☆109May 5, 2017Updated 8 years ago
- ☆223Apr 27, 2015Updated 10 years ago
- Listaa raideja ja silleen☆16Nov 2, 2022Updated 3 years ago
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆39May 21, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆33Oct 20, 2025Updated 5 months ago
- Restrict crawl and scraping scope using matchers.☆26Jun 8, 2016Updated 9 years ago
- Collection of Scrapy utilities (extensions, middlewares, pipelines, etc)☆33Feb 22, 2018Updated 8 years ago
- A scrapy pipeline which send items to Elastic Search server☆98Jan 2, 2018Updated 8 years ago
- High Level Kafka Scanner☆19Sep 29, 2017Updated 8 years ago
- Tool to flatten stream of JSON-like objects, configured via schema☆33Oct 19, 2019Updated 6 years ago
- WordPress plugin for guided product tours powered by Intro.js☆16Feb 9, 2014Updated 12 years ago
- Convert Javascript code to an XML document☆188Mar 14, 2022Updated 4 years ago
- Scrapy spider middleware to split an item into multiple items using a multi-valued key☆21Feb 8, 2017Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Library designed to replace the SQLite backend by a MongoDB backend on Scrapy queue management☆17Sep 2, 2017Updated 8 years ago
- Python library of web-related functions☆418Mar 20, 2026Updated 3 weeks ago
- Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even whe…☆55May 21, 2024Updated last year
- Keywords enrichment by autocompletion (AWS, PM, RDC, CDS, ...), google suggestion scraping Heavy multithreaded semantic corpus crawler S…☆12May 22, 2015Updated 10 years ago
- Paginating the web☆37Feb 11, 2014Updated 12 years ago
- Google reverse image search scraper in PHP☆21Jan 21, 2013Updated 13 years ago
- A pure-python HTML screen-scraping library☆1,887Apr 4, 2022Updated 4 years ago
- A backend for StatsD to emit stats to mongodb.☆40Aug 7, 2018Updated 7 years ago
- HTTP API for Scrapy spiders☆879Mar 20, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Scrapy project to scrape public web directories (educational)☆22Mar 18, 2017Updated 9 years ago
- ElasticSearch Backbone library for quickly building Faceted Search front ends.☆103Jul 24, 2014Updated 11 years ago
- Scrapinghub Command Line Client☆130Updated this week
- Small set of utilities to simplify writing Scrapy spiders.☆50Jul 24, 2015Updated 10 years ago
- iOS universal links 服务器☆11Jun 23, 2017Updated 8 years ago
- Google Universial Analytics Measurement Protocol Implementation for Java☆11May 29, 2013Updated 12 years ago
- Scrapy spider middleware to clean up query parameters in request URLs☆24Jun 30, 2016Updated 9 years ago
- Convert GPS degrees, minutes, seconds coordinates to decimal value. Useful for parsing PGS exif tags in geotagged images, Google Maps, an…☆18Oct 31, 2016Updated 9 years ago
- Leaflet plugin for precise feature selection☆19Jan 4, 2014Updated 12 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Simple Python3 Supervisor library☆14Apr 6, 2026Updated last week
- Pyppeteer integration for Scrapy☆58Feb 26, 2021Updated 5 years ago
- Minimizes Django templates so that html is served up already minimized. Minimizes django templates and the html, in-line javascript, and…☆27Dec 6, 2015Updated 10 years ago
- Blog Helper is a Alexa skill that provides a voice interface for WordPress.com blogs☆13Mar 3, 2025Updated last year
- NER toolkit for HTML data☆259May 3, 2024Updated last year
- A scalable frontier for web crawlers☆1,329Jun 6, 2025Updated 10 months ago
- Web Crawling UI and HTTP API, based on Scrapy and Tornado☆160Apr 7, 2026Updated last week