the-dataface / Newspaper-Scrapers
Scrape article metadata from major media outlet's websites, including NYT, WaPo, WSJ. Built on top of the Newspaper Python Library (https://github.com/codelucas/newspaper).
☆38Updated 7 years ago
Related projects: ⓘ
- ☆25Updated this week
- Web scraper for indeed job search to reveal the data scientist required skills keywords☆36Updated 7 years ago
- Google News Scraper for languages like Japanese, Chinese... [VPN Support]☆94Updated 3 years ago
- A Python Package which helps to scrape all news details from any news websites☆179Updated last year
- Python script to extract as much structured information as possible from annual/quarterly reports.☆90Updated 8 months ago
- Python scripts to extract connection data and send connection requests on LinkedIn using Selenium WebDriver.☆26Updated 4 years ago
- Facebook Page and Group's Post Scraper is a script for gathering data using Facebook's Graph API☆47Updated 4 years ago
- Amber Heard Social Network Analysis of Disinformation/Influence Operations, Bots, & Crime Across-Platforms. - Twitter, Reddit, YouTube, I…☆52Updated last year
- A Google Trends Analytics Package☆13Updated 3 months ago
- The objective of this project is to scrape a corpus of news articles from a set of web pages, pre-process the corpus, and then to apply u…☆50Updated 6 years ago
- Using snscrape and tweepy libraries to scrape unlimited amount of tweets☆25Updated 3 years ago
- A GoodReads.com Scraper script to get books reviews including text and rating.☆39Updated 2 years ago
- A simple bot framework for commenting in subreddits.☆13Updated 7 years ago
- Detailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Tre…☆26Updated 2 years ago
- An open interface to GDELT APIs☆40Updated 9 months ago
- Drivers of freelancer success on Upwork.com☆17Updated 5 years ago
- A web crawler to crawl Best Global University Ranking on usnews website☆12Updated last year
- Scrape data from Goodreads using Scrapy and Selenium☆124Updated 3 months ago
- The Selenium scraper that collected a million stories from Medium.com☆75Updated 5 years ago
- Download and extract MDA section from edgar 10k forms☆76Updated this week
- A news aggregator in python, that focuses primarily on business and market news sources.☆100Updated last year
- An automated, programming-free web scraper for interactive sites☆105Updated last year
- A Python scraper for the Facebook Ad Library, using the official Facebook Ad Library API.☆114Updated 4 years ago
- Application to get a quick lookup of the past financial performance of publicly-traded US companies.☆7Updated last year
- Source real estate prices from the Common Crawl.☆27Updated 5 years ago
- Twitter Trends is a web-based application that automatically detects and analyzes emerging topics in real time through hashtags and user …☆99Updated 7 years ago
- Scrapes sites. Gets news. Eventually events.☆80Updated 8 years ago
- This repository provides usage examples for the Python module Newspaper3k.☆138Updated 8 months ago
- A dashboard is worth a thousand words => https://datastudio.google.com/reporting/755f3183-dd44-4073-804e-9f7d3d993315☆27Updated 2 years ago
- A Python script to retrieve plain text transcripts from YouTube videos☆27Updated 7 years ago