sahilbhange / Facebook-Data-Extraction
#DataPipeLine #ETL - Created is a Facebook data extraction utility to extract the publicly available data on Facebook. Used Facebook Graph API and Python to extract the data and loaded the data into the CSV files for further analysis.
☆14Updated 6 years ago
Alternatives and similar repositories for Facebook-Data-Extraction
Users that are interested in Facebook-Data-Extraction are comparing it to the libraries listed below
Sorting:
- Learn how to auto-ingest streaming data into Snowflake using Snowpipe.☆23Updated 2 years ago
- A curated list of awesome Databricks resources, including Spark☆18Updated 10 months ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 6 years ago
- Learn to build a data pipeline with Airflow to automate wrangling data - An Udacity Data Engineer Nano Degree Project☆8Updated 5 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- Machine Learning in Snowflake☆24Updated 5 years ago
- Spark implementation of Slowly Changing Dimension type 2☆11Updated 6 years ago
- Here's how to get DataQuest's Data Engineering Track missions' content to work on your localhost. Using data from my Valenbisi ARIMA mode…☆15Updated 6 years ago
- All the Snowflake Virtual Warehouse - Example☆12Updated 4 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- pyspark dataframe made easy☆16Updated 3 years ago
- Example Python and R code for Cloudera Machine Learning (CML) training☆14Updated 4 years ago
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Updated 4 years ago
- Cloned by the `dbt init` task☆61Updated last year
- Big Data Demystified meetup and blog examples☆31Updated 9 months ago
- Snowflake Cookbook, published by Packt☆79Updated 2 years ago
- Road to Azure Data Engineer Part-I: DP-200 - Implementing an Azure Data Solution☆67Updated 4 years ago
- DIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics control framework that can be used to monitor, log, aud…☆27Updated 3 weeks ago
- ☆14Updated 6 years ago
- Code snippets and tools published on the blog at lifearounddata.com☆12Updated 5 years ago
- ☆87Updated 2 years ago
- Source code for 'PySpark Recipes' by Raju Kumar Mishra☆25Updated 5 years ago
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆24Updated 6 years ago
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆16Updated 6 years ago
- Analyzing and calculating key marketing metrics with SQL and Python☆14Updated 6 years ago
- Scripts complement the Optimizing a Data Vault data warehouse on the Snowflake Cloud Data Platform webinar☆14Updated 4 years ago
- Full stack data engineering tools and infrastructure set-up☆52Updated 4 years ago
- Spark app to merge different schemas☆23Updated 4 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆16Updated 5 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆27Updated 2 years ago