sahilbhange / Facebook-Data-Extraction
#DataPipeLine #ETL - Created is a Facebook data extraction utility to extract the publicly available data on Facebook. Used Facebook Graph API and Python to extract the data and loaded the data into the CSV files for further analysis.
☆14Updated 6 years ago
Alternatives and similar repositories for Facebook-Data-Extraction:
Users that are interested in Facebook-Data-Extraction are comparing it to the libraries listed below
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 5 years ago
- A curated list of awesome Databricks resources, including Spark☆17Updated 9 months ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆54Updated last year
- Spark implementation of Slowly Changing Dimension type 2☆11Updated 6 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆27Updated 2 years ago
- Learn how to auto-ingest streaming data into Snowflake using Snowpipe.☆23Updated 2 years ago
- Learning Google BigQuery, published by Packt☆14Updated 2 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- Source code for 'BigQuery for Data Warehousing' by Mark Mucchetti☆16Updated 4 years ago
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆30Updated last year
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Updated 4 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆16Updated 5 years ago
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆15Updated 6 years ago
- This repo consists of all important concepts for data engineers.☆11Updated 4 months ago
- ☆18Updated last year
- Big Data Demystified meetup and blog examples☆31Updated 8 months ago
- Snowflake Cookbook, published by Packt☆79Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆51Updated 4 years ago
- Teaching notes from my Advanced SQL workshops as local lead instructor at General Assembly New York. The first edition was created for th…☆18Updated 5 years ago
- Complete Repository to become an expert is SQL Window Functions☆25Updated last year
- This repository contains code to build an MVP search engine with google like interface.☆15Updated 4 years ago
- 🔍Your Data Quality Detector / Gain insight into your data and get it ready for use before you start working with it 💡📊🛠💎☆16Updated 2 years ago
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- Simple ETL pipeline using Python☆26Updated last year
- Content related to Mastering Postgresql along with videos.☆15Updated 3 years ago
- ☆87Updated 2 years ago
- GitHub repository related to the course Mastering Elastic Map Reduce for Data Engineers☆24Updated 2 years ago
- ☆25Updated 4 years ago
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆98Updated 8 months ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆27Updated 2 years ago