JoshData / xml_diffLinks
Compares two XML documents by diffing their text.
☆43Updated last year
Alternatives and similar repositories for xml_diff
Users that are interested in xml_diff are comparing it to the libraries listed below
Sorting:
- Transform flat data structures into nested object graphs matching JSON schema definitions.☆28Updated 9 years ago
- Python library with common functionality for writing web scrapers☆102Updated 10 years ago
- Python library and command line tool for converting data from one format to another☆99Updated 5 years ago
- Skinfer is a tool for inferring and merging JSON schemas☆141Updated last year
- Data Pipes for CSV☆115Updated 3 years ago
- Manage and load dataprotocols.org Data Packages☆27Updated 10 years ago
- Generate SQL tables, load and extract data, based on JSON Table Schema descriptors.☆62Updated 2 years ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 4 years ago
- Utility library to turn country names into ISO two-letter codes☆71Updated 5 months ago
- Find which links on a web page are pagination links☆29Updated 9 years ago
- mltk - Moz Language Tool Kit☆12Updated 10 years ago
- An easy interface for documenting data packages☆19Updated 7 years ago
- Date parsing and normalization utilities for Python.☆22Updated 2 years ago
- Streaming newline delimited JSON I/O.☆12Updated 2 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 9 years ago
- A library for extracting tables from PDF files☆92Updated 5 years ago
- Data validation as a service. Project retired, got to the current one at frictionsless/repository☆69Updated 3 years ago
- A simple command line interface to the datamade/dedupe library.☆43Updated 3 years ago
- Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics.☆80Updated 2 years ago
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 4 years ago
- Tools for generating CSV and other flat versions of the structured data☆109Updated last month
- Inspect a URL and estimate if it contains a news story☆39Updated 2 weeks ago
- Framework for processing data packages in pipelines of modular components.☆123Updated 7 months ago
- Small set of utilities to simplify writing Scrapy spiders.☆49Updated 10 years ago
- An attempt at creating a gold standard dataset for backtesting yesterday & today's content-extractors☆35Updated 10 years ago
- View, visualize, clean and process data in the browser.☆146Updated 7 years ago
- Extract, parse and populate templates from strings☆27Updated 6 years ago
- Modularly extensible semantic metadata validator☆84Updated 10 years ago
- Scraper built with Scrapy.☆18Updated last year
- Definitions of Pardon jargon to help Python beginners understand Pythonista gobbletigook☆55Updated 5 years ago