alexksikes / mass-scraping
Quickly download and scrape websites on a massive scale.
☆64Updated 12 years ago
Alternatives and similar repositories for mass-scraping:
Users that are interested in mass-scraping are comparing it to the libraries listed below
- Scrapes sites. Gets news. Eventually events.☆85Updated 9 years ago
- A library to interface with the Linkscape API.☆40Updated 6 years ago
- Some Python scripts I use for auditing, research and lead generation.☆30Updated 8 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆56Updated last year
- Crawler and scraper of the public directory of companies on LinkedIn.☆25Updated 6 years ago
- Summary is a complete solution to extract the title, image and description from any URL.☆18Updated last year
- SEO Tool to track ranking of keywords on search engines (google app engine application)☆50Updated 12 years ago
- A python tool to extract data types such as email, URL, domains and phone numbers.☆38Updated 11 years ago
- Get data about companies from advanced search without the use of API☆62Updated 5 years ago
- Scraping Amazon reviews using headless chrome and selenium☆10Updated 6 years ago
- scraping from walmart, target and homedepot website and getting data from amazon api☆16Updated 8 years ago
- framework for scraping legislative/government data☆85Updated 7 months ago
- 👨👩👦 Social account detection and extraction in Python, e.g. for crawling/scraping.☆46Updated 2 years ago
- Social media monitoring tools such as sentiment analysis, keyword tracking and more☆48Updated 11 years ago
- Various Python scripts to scrape sites that store data about you.☆28Updated 11 years ago
- ScraperWiki Python library for scraping and saving data☆159Updated 2 years ago
- Scrape every LinkedIn public profile using Scrapy (Python)☆15Updated 10 years ago
- Collection of python scripts I have created to crawl various websites, mostly for lead generation projects to match keywords and collect …☆131Updated last year
- Scrapy middleware which allows to crawl only new content☆80Updated 2 years ago
- A browser extension that lets you find email addresses for any domain with a single click.☆71Updated 7 years ago
- A python script that interface Facebook chat to a chatbot using XMPP protocol.☆29Updated 12 years ago
- API - extract a list of keywords from a text.☆18Updated 7 years ago
- Extract social media links and account names from websites.☆38Updated 4 years ago
- ☆35Updated last year
- Collection of scripts for The TWINT project☆54Updated 5 years ago
- Scrapes upwork.com using BeautifulSoup and Selenium☆12Updated 7 years ago
- Python script that periodically probes the Craigslist RSS feeds for new listings.☆39Updated 13 years ago
- Automates the process of repeatedly searching for a website via scraped proxy IP and search keywords☆44Updated last year
- Scrape the Google search result with Scrapy.☆98Updated 5 years ago
- A library to parse Wayback Machine of archive.org to get a historical views of web pages. It is a useful tool to research on the evolutio…☆20Updated 6 years ago