sahilbhange / Facebook-Data-Extraction
#DataPipeLine #ETL - Created is a Facebook data extraction utility to extract the publicly available data on Facebook. Used Facebook Graph API and Python to extract the data and loaded the data into the CSV files for further analysis.
☆14Updated 6 years ago
Related projects: ⓘ
- A curated list of awesome Databricks resources, including Spark☆14Updated 2 months ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 5 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- ☆26Updated 4 years ago
- Code Repository for GCP: Complete Google Data Engineer and Cloud Architect Guide(v), Published by Packt☆16Updated last year
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Updated 4 years ago
- Big Data Demystified meetup and blog examples☆31Updated last month
- Supplementary material for Building a Modern Data Platform with Snowflake, from Pearson.☆21Updated 2 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆15Updated 7 months ago
- Road to Azure Data Engineer Part-I: DP-200 - Implementing an Azure Data Solution☆65Updated 4 years ago
- ☆11Updated this week
- Azure Data Engineering Cookbook 2nd-edition, published by Packt☆31Updated last year
- Repository for Microsoft Databricks Training Events - Hosted by BlueGranite☆14Updated 5 years ago
- My Study guide used to pass the CRT020 Spark Certification exam☆31Updated 4 years ago
- my personal working directory of milvus projects☆13Updated 6 months ago
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆28Updated 5 months ago
- Learn how to auto-ingest streaming data into Snowflake using Snowpipe.☆23Updated 2 years ago
- Data Engineering with Databricks Cookbook, published by Packt☆26Updated 3 months ago
- ☆14Updated this week
- Source code for 'PySpark Recipes' by Raju Kumar Mishra☆25Updated 4 years ago
- pyspark dataframe made easy☆15Updated 2 years ago
- Apache Spark using SQL☆14Updated 3 years ago
- Machine Learning in Snowflake☆24Updated 5 years ago
- AWS Big Data Certification☆24Updated last year
- CSV loader for Amazon Redshift.☆12Updated 5 years ago
- Collection of Databricks and Jupyter Notebooks☆22Updated 6 months ago
- This repo contains live examples to build Databricks' Lakehouse and recommended best practices from the field.☆12Updated 2 weeks ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆25Updated 2 years ago
- ☆15Updated 3 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆35Updated 2 months ago