Tool to flatten stream of JSON-like objects, configured via schema
☆33Oct 19, 2019Updated 6 years ago
Alternatives and similar repositories for flatson
Users that are interested in flatson are comparing it to the libraries listed below
Sorting:
- Skinfer is a tool for inferring and merging JSON schemas☆141Apr 24, 2024Updated last year
- A python implementation of DEPTA☆83Jan 14, 2017Updated 9 years ago
- Restrict crawl and scraping scope using matchers.☆26Jun 8, 2016Updated 9 years ago
- Detect and classify pagination links☆15Sep 9, 2020Updated 5 years ago
- Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even whe…☆55May 21, 2024Updated last year
- A project to attempt to automatically login to a website given a single seed☆11Jun 17, 2024Updated last year
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40May 21, 2024Updated last year
- A Scrapy extension to log items coverage when the spider shuts down☆19Apr 11, 2020Updated 5 years ago
- NER toolkit for HTML data☆259May 3, 2024Updated last year
- Convert Javascript code to an XML document☆187Mar 14, 2022Updated 3 years ago
- A linter for Scrapy projects.☆21Updated this week
- Simple Scrapy middleware to process non-well-formed HTML with BeautifulSoup☆21Sep 26, 2016Updated 9 years ago
- A decorator to write coroutine-like spider callbacks.☆109Dec 26, 2022Updated 3 years ago
- A python library detect and extract listing data from HTML page.☆108May 5, 2017Updated 8 years ago
- Extract embedded metadata from HTML markup☆951Oct 1, 2025Updated 5 months ago
- Use pyppeteer from a Scrapy spider☆59Feb 5, 2020Updated 6 years ago
- A component that tries to avoid downloading duplicate content☆27Feb 10, 2026Updated 3 weeks ago
- Sentry component for Scrapy☆86Aug 21, 2023Updated 2 years ago
- A simple algorithm for clustering web pages, suitable for crawlers☆35Mar 6, 2017Updated 8 years ago
- A collection of github workflow patterns☆10Feb 1, 2024Updated 2 years ago
- MongoDB extensions for Scrapy☆44Oct 2, 2014Updated 11 years ago
- Formasaurus tells you the type of an HTML form and its fields using machine learning☆119Feb 23, 2026Updated last week
- [Course Project, CS 251( 2018-1 ) - IIT Bombay] A secure Personal Cloud storage for files - Web Application( Django)☆10Mar 2, 2020Updated 6 years ago
- Generate and publish Grafana dashboards in Java. Build your own "blocks" and use auto-complete!☆11May 31, 2017Updated 8 years ago
- Semantic search web application with graph visualization in Django☆12Aug 2, 2017Updated 8 years ago
- A generic crawler☆78Feb 10, 2026Updated 3 weeks ago
- Vue.js + S3 => CMS☆12Nov 23, 2021Updated 4 years ago
- DeepAlign: Alignment-based Process Anomaly Correction Using Recurrent Neural Networks☆10Mar 25, 2023Updated 2 years ago
- Provide pagination for django-rest-framework using a "Link" HTTP header☆42Jul 24, 2024Updated last year
- Trending Places in OpenStreetMap!☆11Apr 28, 2017Updated 8 years ago
- Turn any Wikipedia article into a narrated, subtitled video — fully automated, CLI-first.☆20Jan 21, 2026Updated last month
- Debug your microservices application from IntelliJ IDEA☆14Feb 6, 2018Updated 8 years ago
- Use Python3, Django, Django-rest-framework to achieve alipay payment. 包括支付宝支付,支付宝服务器异步通知,支付宝退款☆12May 26, 2018Updated 7 years ago
- Linux /proc data in a consistent, parsed format.☆10Mar 28, 2016Updated 9 years ago
- Automated operation and maintenance platform based on SaltStack.☆10Apr 19, 2020Updated 5 years ago
- OpenTracing Instrumentation for RxJava☆10Dec 11, 2020Updated 5 years ago
- simple python gevent web spider☆23Jun 27, 2011Updated 14 years ago
- HTTP Shell is a CLI tool based on the Kui framework that provides developers a modern alternative to http clients for interacting with AP…☆12Dec 17, 2020Updated 5 years ago
- ☆13Jan 12, 2024Updated 2 years ago