udger / udger-pythonLinks
Python agent string parser based on Udger https://udger.com/products/local_parser
☆40Updated 2 years ago
Alternatives and similar repositories for udger-python
Users that are interested in udger-python are comparing it to the libraries listed below
Sorting:
- Find the path of a key / value in a JSON hierarchy easily.☆97Updated 9 months ago
- URL Transformation, Sanitization☆103Updated 2 years ago
- A generic crawler☆78Updated last week
- A component that tries to avoid downloading duplicate content☆27Updated last week
- A project to attempt to automatically login to a website given a single seed☆127Updated last week
- Modern robots.txt Parser for Python☆197Updated 2 years ago
- Restrict crawl and scraping scope using matchers.☆26Updated 9 years ago
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40Updated last year
- URL normalization for Python☆99Updated 9 months ago
- python elasticsearch client☆362Updated 3 years ago
- Bringing sanity to world of messed-up data☆66Updated 11 years ago
- PyQuery-based scraping micro-framework.☆118Updated 4 years ago
- An Authenticated Encryption with Associated Data (AEAD) implementation for Python.☆37Updated 8 years ago
- MaxMind GeoLite2 database as a convenient Python package☆63Updated 7 years ago
- Crochet: use Twisted anywhere!☆240Updated last year
- Pyzmail is a high level mail library for Python, providing functions to read, compose and send emails☆58Updated 7 years ago
- Collection of persistent (disk-based) and non-persistent (memory-based) queues for Python☆291Updated this week
- Scrapy middleware which allows to crawl only new content☆79Updated last week
- A Python module to fetch and parse results from different search engines.☆79Updated 7 years ago
- Scrapy schema validation pipeline and Item builder using JSON Schema☆45Updated 4 years ago
- Python implementation of the Parsley language for extracting structured data from web pages☆92Updated 8 years ago
- Tools that will make writing tests, bots and scrapers using Selenium much easier☆139Updated last year
- Scrapinghub Command Line Client☆131Updated 2 months ago
- A parser for the free proxy list on HideMyAss!☆58Updated 8 years ago
- Python task queue☆49Updated 7 years ago
- Tool to flatten stream of JSON-like objects, configured via schema☆33Updated 6 years ago
- Extracts the top level domain (TLD) from the URL given.☆185Updated 8 months ago
- Crochet-based blocking API for Scrapy.☆46Updated 8 years ago
- Simple Human wrapper for cURL library☆201Updated 4 years ago
- Python client for Elasticsearch Watcher (deprecated)☆23Updated 7 years ago