the-dataface / Newspaper-Scrapers
Scrape article metadata from major media outlet's websites, including NYT, WaPo, WSJ. Built on top of the Newspaper Python Library (https://github.com/codelucas/newspaper).
☆45Updated 7 years ago
Alternatives and similar repositories for Newspaper-Scrapers:
Users that are interested in Newspaper-Scrapers are comparing it to the libraries listed below
- Downloads news articles from Google news and uses pre-trained NLP models to perform sentiment analysis☆55Updated 2 years ago
- Jupyter notebooks for Data Science for Journalism☆15Updated 4 years ago
- Scrapers from a project in 2018. Yelp, Spyfu, Similarweb, Morningstar, Linkedin, Instagram, Inside, Glassdoor, Facebook, Eat24, Doordash,…☆91Updated 5 years ago
- A news aggregator in python, that focuses primarily on business and market news sources.☆110Updated this week
- A set of Spiders to gather product's data from Etsy Website.☆36Updated 4 years ago
- A Python Package which helps to scrape all news details from any news websites☆189Updated 2 months ago
- Finco automates the process of generating financial documentation and valuations for companies traded on the NASDAQ and NYSE. Provides us…☆25Updated 9 years ago
- Simple Python utility that downloads and extracts SEC financial statement data sets.☆32Updated 7 years ago
- This repository includes our work on extracting the digital transformation strategy of Fortune 500 companies from earnings calls transcri…☆27Updated 4 years ago
- A classifier that distinguishes political from non-political news articles.☆28Updated last year
- Application to get a quick lookup of the past financial performance of publicly-traded US companies.☆7Updated 2 years ago
- This program extracts insider trading data from the sec website and stores it in excel file for the specified time frame.☆51Updated 2 years ago
- Yellowpages.com Web Scraper written in Python and LXML to extract business details available based on a particular category and location.☆72Updated 4 years ago
- Sample notebooks for using the Global Database of Events, Language and Tone (GDELT).☆16Updated 4 years ago
- Python script to extract as much structured information as possible from annual/quarterly reports.☆95Updated last year
- Bot for scraping job profiles and applying to jobs on Indeed.☆13Updated 5 years ago
- Using Python and Gephi to map and visualize personal twitter networks☆20Updated 4 years ago
- Facebook Page and Group's Post Scraper is a script for gathering data using Facebook's Graph API☆48Updated 4 years ago
- Google News Scraper for languages like Japanese, Chinese... [VPN Support]☆96Updated 3 years ago
- Newsfeed based on GDELT Project☆22Updated 8 months ago
- Scrape and parse Google search results in Python☆32Updated last year
- Dropshipping Bot that leverages Walmart's API to search for overpriced merchandise (Jewelry is the example) and uploads to Ebay to resell…☆28Updated 5 years ago
- A Google Trends Analytics Package☆13Updated 7 months ago
- Using snscrape and tweepy libraries to scrape unlimited amount of tweets☆26Updated 3 years ago
- This page has instructions and a python script to help you break GDELT Data away from the Google Big Query Environment☆11Updated 3 years ago
- Detailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Tre…☆27Updated 2 years ago
- Arbitrage Engine for Amazon, eBay, and Facebook Marketplace☆28Updated 6 years ago
- A set of jupyter notebooks demonstrating how to use the Media Cloud API.☆35Updated last year
- A simple machine learning package to cluster keywords in higher-level groups.☆16Updated 2 years ago