knadh / xmlutils.py
Python scripts for processing XML documents and converting to SQL, CSV, and JSON [UNMAINTAINED]
☆243Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for xmlutils.py
- Python library and command line tool for converting data from one format to another☆100Updated 4 years ago
- ScraperWiki Python library for scraping and saving data☆159Updated last year
- Analysis and visualization of email data☆142Updated 7 years ago
- Python scripts for scraping bus ticket data from the websites of BoltBus, Greyhound, Megabus, GoBus, Amtrak, Peterpan, and EasternTravel.☆39Updated 4 years ago
- Scraping Tweet data for Russian Troll Twitter accounts into Neo4j☆57Updated 6 years ago
- A library for extracting tables from PDF files☆90Updated 11 years ago
- Sometimes sites make crawling hard. Selenium-crawler uses selenium automation to fix that.☆125Updated 11 years ago
- Converts JSON files to CSV (pulling data from nested structures). Useful for Mongo data☆264Updated 3 years ago
- Chunks of Python I've found useful.☆63Updated 4 years ago
- json to xml converter in python3☆99Updated 3 weeks ago
- Python client library for controlling Google Refine☆39Updated 11 years ago
- pyaddress is an address parsing library, taking the guesswork out of using addresses in your applications. We use it as part of our apart…☆100Updated 5 years ago
- ☆13Updated 9 years ago
- Scrapes sites. Gets news. Eventually events.☆82Updated 8 years ago
- xmldataset: xml parsing made easy 🗃️☆77Updated 4 years ago
- 🔎 Finds fuzzy matches between CSV files☆184Updated 7 months ago
- Python classes for streaming graph to gephi☆81Updated 8 years ago
- Extract tables from PDF pages.☆277Updated 4 years ago
- Scrapy middleware which allows to crawl only new content☆79Updated 2 years ago
- Python library with common functionality for writing web scrapers☆102Updated 9 years ago
- Python module to drive the awesome pdftk binary.☆147Updated last year
- Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.☆78Updated last year
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 7 years ago
- Switch to the sigmaexporter-plugin branch (https://github.com/oxfordinternetinstitute/gephi-plugins/tree/sigmaexporter-plugin/modules/sig…☆63Updated 4 years ago
- Scrape a public LinkedIn profile.☆153Updated 4 months ago
- No longer maintained! See https://bitbucket.org/vangheem/pyzipcode☆20Updated 5 years ago
- A client for Feedly☆147Updated 6 years ago
- Street address parser and formatter☆92Updated 5 years ago
- Nested JSON to CSV Converter☆287Updated 2 years ago