rocheio / wiki-table-scrapeLinks
Scrape tables from Wikipedia articles into CSVs
☆75Updated 5 years ago
Alternatives and similar repositories for wiki-table-scrape
Users that are interested in wiki-table-scrape are comparing it to the libraries listed below
Sorting:
- ⛏ a library for scraping unreliable pages☆212Updated last month
- Extract countries, regions and cities from a URL or text☆217Updated 5 years ago
- track changes to the news, where news is anything with an RSS feed☆182Updated 5 years ago
- Python scripts for creating stylistic word clouds☆87Updated 9 years ago
- 🔎 Finds fuzzy matches between CSV files☆191Updated 10 months ago
- Tools for generating CSV and other flat versions of the structured data☆109Updated last month
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 9 years ago
- A Python module to discover the etymology of words☆152Updated last year
- A Twitter search client mining tweets using their advanced search implemtation.☆90Updated 7 years ago
- Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py☆393Updated 2 years ago
- Import tables from any Wikipedia article as a dataset in Python☆293Updated 4 years ago
- Python script to load CSV to SQLite☆249Updated 2 years ago
- Tools for parsing and querying Wikimedia Foundation pageview data from both static dumps and the online API.☆66Updated 3 years ago
- Detect and visualize text reuse☆119Updated last year
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆157Updated 4 months ago
- Python 3.x notebooks about real-world data cleaning and visualization☆72Updated 9 years ago
- Real-time sentiment analysis in Python using twitter's streaming api☆255Updated 7 years ago
- Python package for data.world☆101Updated last year
- Python library for reading and writing tabular data via streams.☆238Updated 4 years ago
- Predict age and gender from a first name☆59Updated 7 years ago
- Automatically extracts and normalizes an online article or blog post publication date☆118Updated 2 years ago
- Get gender from first name in python☆165Updated 7 years ago
- Parse Popolo JSON data and navigate it with Python☆15Updated 6 years ago
- Population figures for countries, regions (e.g. Asia) and the world.☆105Updated 10 months ago
- Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis☆592Updated 2 years ago
- How Quartz used AI to help reporters search the Mauritius Leaks☆48Updated 6 years ago
- Visualise Wikipedia page edits using History Flow☆48Updated 9 years ago
- A wrapper around tweepy to produce pandas dataframes for analysis☆75Updated 9 years ago
- A Python script that parses post titles, self-texts, and comments on reddit and makes word clouds out of the word frequencies.☆293Updated 2 years ago
- Python script for matching a list of messy addresses against a gazetteer using dedupe.☆64Updated 5 years ago