san089 / Big_Data_Project
Fake News Detection - Feature Extraction using Vectorization such as Count Vectorizer, TFIDF Vectorizer, Hash Vectorizer,. Then used an Ensemble model to classify whether the news is fake or not.
☆19Updated 5 years ago
Alternatives and similar repositories for Big_Data_Project:
Users that are interested in Big_Data_Project are comparing it to the libraries listed below
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆144Updated 4 years ago
- Udacity Data Engineering Nanodegree Capstone Project☆36Updated 5 years ago
- Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collab…☆37Updated 5 years ago
- Git Repository☆140Updated 3 months ago
- Stream processing with Azure Databricks☆138Updated 5 months ago
- Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift,…☆56Updated 2 years ago
- Udacity Data Engineering Nano Degree (DEND)☆185Updated 5 years ago
- ☆87Updated 2 years ago
- Price Crawler - Tracking Price Inflation☆186Updated 4 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆160Updated 2 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR☆83Updated 5 years ago
- Ravi Azure ADB ADF Repository☆66Updated 3 months ago
- Projects done in the Data Engineering Nanodegree by Udacity.com☆272Updated 5 years ago
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆45Updated 5 years ago
- This repo contains commands that data engineers use in day to day work.☆60Updated 2 years ago
- I have tried to solve some complex SQL interview questions that had been asked in several company. Collected this question from Ankit Ban…☆98Updated 2 years ago
- apache-spark-with-databricks-for-data-engineering☆84Updated 10 months ago
- Tracking and measuring neighborhood and district-level eviction rates in the city of San Francisco.☆140Updated 4 years ago
- PySpark Projects☆23Updated this week
- Data Engineer with Python lecture notes from #datacamp.☆46Updated 3 years ago
- YouTube tutorial project☆101Updated last year
- Azure Data Factory☆62Updated last month
- Data Engineering, Data Warehouse, Data Mart, Cloud Data, AWS, SAS, Redshift, S3☆30Updated 4 years ago
- This is a template you can use for your next data engineering portfolio project.☆176Updated 3 years ago
- I'm partaking in a Data Engineering Bootcamp / Zoomcamp. I'll store files and progress here.☆104Updated 2 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆127Updated 11 months ago
- ☆151Updated 2 years ago
- Road to Azure Data Engineer Part-I: DP-200 - Implementing an Azure Data Solution☆67Updated 4 years ago
- This is the first project where we worked on apache spark, In this project what we have done is that we downloaded the datasets from KAGG…☆18Updated 3 years ago