zytedata / flattering
Flatten, format, and export any JSON-like data to CSV (or any other string output).
☆17Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for flattering
- Paginating the web☆37Updated 10 years ago
- Detect and classify pagination links☆14Updated 4 years ago
- Scrapy spider middleware to clean up query parameters in request URLs☆25Updated 8 years ago
- Library for annotation-based dependency injection☆22Updated last month
- A scrapy extension to store requests and responses information in storage service☆26Updated 2 years ago
- Ultra-simple human readable DSL for matching text.☆8Updated 7 years ago
- A graph query engine☆10Updated 6 months ago
- Web scraping Page Objects core library☆95Updated last month
- Happy Eyeballs connection algorithm and underlying scheduling logic in asyncio☆12Updated 2 months ago
- Generative tree visualiser for Python☆14Updated 4 years ago
- A python implementation of DEPTA☆83Updated 7 years ago
- Python context manager to communicate with a subprocess using iterables: for when data is too big to fit in memory and has to be streamed☆9Updated last month
- A library for sending software performance metrics from Python libraries and apps to statsd.☆30Updated 5 months ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆65Updated 2 years ago
- Scrapy downloader middleware that stores response HTMLs to disk.☆18Updated 6 months ago
- Custom Python functions for working with SQLite FTS4☆22Updated 2 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 2 years ago
- Common interface for data container classes☆62Updated this week
- Restrict crawl and scraping scope using matchers.☆25Updated 8 years ago
- Python 3 AsyncIO powered scraping framework with batteries included☆20Updated 8 years ago
- Object-relational in-memory database layer based on LMDB☆29Updated last year
- A component that tries to avoid downloading duplicate content☆27Updated 6 years ago
- A state machine service.☆15Updated last year
- Pluggable DSL that uses pipes to perform a series of linear transformations to extract data☆15Updated 4 months ago
- Python clients for Zyte AutoExtract API☆39Updated 2 years ago
- Library to populate items using XPath and CSS with a convenient API☆45Updated last month
- Scrapy schema validation pipeline and Item builder using JSON Schema☆44Updated 3 years ago
- ☆12Updated 7 years ago
- Python's missing statistical Swiss Army knife☆15Updated 9 years ago
- Detect and classify pagination links☆99Updated 4 years ago