sahilbhange / Facebook-Data-Extraction
#DataPipeLine #ETL - Created is a Facebook data extraction utility to extract the publicly available data on Facebook. Used Facebook Graph API and Python to extract the data and loaded the data into the CSV files for further analysis.
☆14Updated 6 years ago
Alternatives and similar repositories for Facebook-Data-Extraction:
Users that are interested in Facebook-Data-Extraction are comparing it to the libraries listed below
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 5 years ago
- Learn how to auto-ingest streaming data into Snowflake using Snowpipe.☆23Updated 2 years ago
- Learn to build a data pipeline with Airflow to automate wrangling data - An Udacity Data Engineer Nano Degree Project☆8Updated 5 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- Spark implementation of Slowly Changing Dimension type 2☆11Updated 6 years ago
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆29Updated 11 months ago
- ☆25Updated 4 years ago
- Snowflake Cookbook, published by Packt☆78Updated 2 years ago
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆15Updated 6 years ago
- Hortonworks Data Platform Retail Analytics Demo☆13Updated 8 years ago
- This data dictionary provides information about the tables and views in the "workgroup" PostgreSQL database of the Tableau Server reposit…☆41Updated 10 months ago
- A curated list of awesome Databricks resources, including Spark☆17Updated 8 months ago
- Big Data Demystified meetup and blog examples☆31Updated 7 months ago
- A curated list of awesome Snowflake analytic data warehouse learning resources☆19Updated 4 years ago
- This repository contains code to build an MVP search engine with google like interface.☆15Updated 4 years ago
- dbt / Amazon Redshift Demonstration Project☆34Updated 2 years ago
- Project based learning for Data Engineering fundamentals.☆13Updated 4 years ago
- Database plugins☆14Updated this week
- Supplementary material for Building a Modern Data Platform with Snowflake, from Pearson.☆21Updated 3 years ago
- Source code for 'PySpark Recipes' by Raju Kumar Mishra☆25Updated 5 years ago
- data engineering 100 days 🤖 🧲 🦾 | #DE☆40Updated last year
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆26Updated 2 years ago
- Course Material☆24Updated 2 years ago
- 3NF-normalize Yelp data on S3 with Spark and load it into Redshift - automate the whole thing with Apache Airflow☆12Updated 5 years ago
- ☆20Updated 5 years ago
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated last year
- Analyzing and calculating key marketing metrics with SQL and Python☆14Updated 6 years ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Updated this week
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Updated 4 years ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-dataproc☆48Updated last year