This project aims to use the Hadoop framework to analyze unstructured data that we obtain from Twitter and perform sentiment and trend analysis using Hive on MapReduce and Spark on keyword “COVID19”. We then compare the Hive and Spark approaches to determine the best performance.
☆17May 8, 2020Updated 5 years ago
Alternatives and similar repositories for Twitter-Data-Analysis-on-COVID19-using-Hadoop-Flume-Hive-and-Spark.
Users that are interested in Twitter-Data-Analysis-on-COVID19-using-Hadoop-Flume-Hive-and-Spark. are comparing it to the libraries listed below
Sorting:
- ☆10May 25, 2021Updated 4 years ago
- Architecture of Twint scrapper which allow download tweets on many instances without api restrictions☆10Nov 30, 2020Updated 5 years ago
- A Python Reddit scraper with dual-mode architecture: simple requests for small jobs, async + proxy rotation for large-scale scraping. Fea…☆16Oct 30, 2025Updated 4 months ago
- The program can be used to scrape the content from an article from web by an input of a set of URLs in a text file or a URL. This project…☆17Aug 5, 2020Updated 5 years ago
- Examples for using the Pipl SEARCH API☆11Dec 19, 2023Updated 2 years ago
- A Python package for accessing the OpenCorporates API☆11Feb 12, 2019Updated 7 years ago
- Scrape most mentioned stock tickers from Reddit. Wallstreetbets and Wallstreetbetsnew☆12Mar 5, 2021Updated 5 years ago
- Twitter based sentiment analysis using JAVA and Hadoop. In this project we are doing the sentiment analysis on twitter data to analyse wh…☆10Apr 22, 2018Updated 7 years ago
- Web app which displays the daily and hourly sentiments for a stock (user to enter ticker as input). Stock sentiments are determined from…☆10Sep 26, 2022Updated 3 years ago
- Keep up with all tech trends from single page!☆10Jul 30, 2022Updated 3 years ago
- ☆12Jul 6, 2021Updated 4 years ago
- Elementor widgets development tutorial☆13Dec 6, 2022Updated 3 years ago
- Tools and dumps related to the Smishing Triad and the USPS smishing campaign from late 2023 into 2024☆11Apr 28, 2024Updated last year
- Search Google from CLI☆10Nov 5, 2022Updated 3 years ago
- Advanced Crawling Add-on for WP2Static☆11May 10, 2021Updated 4 years ago
- A python SDK for accessing the Keymate-API☆13Jun 25, 2024Updated last year
- Solidity smart contract for atomic swaps.☆10Oct 31, 2022Updated 3 years ago
- Build wordlists from the common-crawl index☆12Oct 9, 2022Updated 3 years ago
- ☆10Nov 12, 2022Updated 3 years ago
- ☆13Aug 27, 2020Updated 5 years ago
- A Feature rich Gatsby theme plugin for creating blogs from headless WordPress CMS.☆10Feb 4, 2026Updated last month
- We scrape news headlines for FB and TSLA then apply sentiment analysis to generate investment insight.☆12Nov 30, 2020Updated 5 years ago
- Newsdata.io Official Python Client☆14Jan 14, 2026Updated last month
- Air Traffic Control Simulator☆17Mar 13, 2022Updated 3 years ago
- Open Database Hunting - Finding potential breaches.☆10Feb 2, 2022Updated 4 years ago
- This repository contains my implementation of the algorithms described in the book "Data Science From Scratch" by Joel Grus. Please scrol…☆11Jan 28, 2020Updated 6 years ago
- Code from Bellingcat's guide☆11Dec 8, 2022Updated 3 years ago
- ☆10Apr 2, 2022Updated 3 years ago
- A Python library for creating adversarial splits☆14Jul 24, 2022Updated 3 years ago
- 🕷️ n8n Community Node for Scrappey API – Automate web scraping and data extraction with advanced anti-bot blocking technology, seamlessl…☆16Feb 2, 2026Updated last month
- Global Terrorism Database Interactive Dashboard☆10Dec 8, 2022Updated 3 years ago
- Visual search interface☆11Nov 30, 2021Updated 4 years ago
- WPScan is a black box WordPress vulnerability scanner.☆10Oct 6, 2017Updated 8 years ago
- Detecting fake news articles by analyzing patterns in writing.☆10Mar 30, 2020Updated 5 years ago
- RSS Feed integration for the Notion using AWS Lambda☆11Jun 2, 2021Updated 4 years ago
- Exploits Wikipedia's daily view counts to find out what topics are current trends☆18May 7, 2013Updated 12 years ago
- NodeJS backend for SentiSocial☆10Oct 27, 2018Updated 7 years ago
- An open-source toolkit for analyzing line-oriented JSON Twitter archives with Apache Spark.☆10Dec 11, 2024Updated last year
- A node for n8n that integrates the Tavily API, enabling powerful web search and content extraction within your no-code automation workflo…☆19Feb 26, 2026Updated last week