the-dataface / Newspaper-ScrapersLinks
Scrape article metadata from major media outlet's websites, including NYT, WaPo, WSJ. Built on top of the Newspaper Python Library (https://github.com/codelucas/newspaper).
☆53Updated 7 years ago
Alternatives and similar repositories for Newspaper-Scrapers
Users that are interested in Newspaper-Scrapers are comparing it to the libraries listed below
Sorting:
- A Python Package which helps to scrape all news details from any news websites☆208Updated last week
- The objective of this project is to scrape a corpus of news articles from a set of web pages, pre-process the corpus, and then to apply u…☆50Updated 7 years ago
- Google News Scraper for languages like Japanese, Chinese... [VPN Support]☆98Updated 4 years ago
- ☆65Updated 4 years ago
- A Python scraper for the Facebook Ad Library, using the official Facebook Ad Library API.☆119Updated 5 years ago
- Scrapers from a project in 2018. Yelp, Spyfu, Similarweb, Morningstar, Linkedin, Instagram, Inside, Glassdoor, Facebook, Eat24, Doordash,…☆97Updated 6 years ago
- Downloads news articles from Google news and uses pre-trained NLP models to perform sentiment analysis☆58Updated 3 years ago
- A financial disclosure data extraction tool.☆16Updated last year
- Promise Tracker is a tool designed to help journalists and civil society watchdogs track campaign/promises/pledges by government official…☆15Updated 8 months ago
- 🤩 Python Package for Scraping Amazon Product Reviews ✨☆37Updated 2 years ago
- Jupyter notebooks for Data Science for Journalism☆15Updated 5 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- A classifier that distinguishes political from non-political news articles.☆30Updated last year
- Web scraping Reddit without using Reddit API, and making a dataset, and using the dataset for a machine learning project.☆80Updated 2 years ago
- Securities and Exchange Commission utility package for dealing with Edgar database. Includes methods to download index files and SEC file…☆36Updated 4 years ago
- A news crawler for BBC News, Reuters and New York Times.☆117Updated 2 years ago
- For all kinds of textual analysis: literary, social media, surveys...☆32Updated 3 years ago
- Scrape data from Quora website: questions related to certain topics, answers given on certain questions and users profile data☆54Updated 2 years ago
- Social Network Analysis of Disinformation, Platforms, Freelancing around Amber Heard Johnny Depp Elon Musk- Twitter, Reddit, YouTube, Ins…☆57Updated 2 years ago
- A dashboard is worth a thousand words => https://datastudio.google.com/reporting/755f3183-dd44-4073-804e-9f7d3d993315☆28Updated 3 years ago
- A repository demonstrating the use of real-estate-scrape to store the estimated value of a property on Redfin and Zillow every night usin…☆34Updated this week
- A Python Client for collect and parse public data from the Youtube Data API☆81Updated last year
- I'm a curious person and analysing world news is fun. Here I'm gathering all my Gdelt-related projects.☆22Updated 4 years ago
- This is one of the first and main programs i made for 99☆22Updated 2 years ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 8 months ago
- A Selenium based automated program that scrapes profiles data,stores in CSV,follows them and saves their profile in PDF.☆33Updated last year
- Scrape data from Goodreads using Scrapy and Selenium☆139Updated last year
- Scrape and parse Google search results in Python☆31Updated 2 years ago
- A set of jupyter notebooks demonstrating how to use the Media Cloud API.☆38Updated this week
- Python package to scrape product review data from amazon☆31Updated 3 years ago