gambolputty / newscorpusLinks
A Python scraping module, that extracts text from articles found in RSS feeds. Uses SQLite as database.
☆20Updated last year
Alternatives and similar repositories for newscorpus
Users that are interested in newscorpus are comparing it to the libraries listed below
Sorting:
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆24Updated 5 months ago
- A helper library full of URL-related heuristics.☆73Updated 3 months ago
- Extract networks of entities from journalistic reporting