sahilbhange / Facebook-Data-ExtractionLinks
#DataPipeLine #ETL - Created is a Facebook data extraction utility to extract the publicly available data on Facebook. Used Facebook Graph API and Python to extract the data and loaded the data into the CSV files for further analysis.
☆14Updated 6 years ago
Alternatives and similar repositories for Facebook-Data-Extraction
Users that are interested in Facebook-Data-Extraction are comparing it to the libraries listed below
Sorting:
- Learn how to auto-ingest streaming data into Snowflake using Snowpipe.☆23Updated 2 years ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 6 years ago
- A curated list of awesome Databricks resources, including Spark☆19Updated 11 months ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆27Updated 2 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- Learn to build a data pipeline with Airflow to automate wrangling data - An Udacity Data Engineer Nano Degree Project☆8Updated 5 years ago
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆30Updated last year
- Big Data Demystified meetup and blog examples☆31Updated 9 months ago
- dbt / Amazon Redshift Demonstration Project☆34Updated 2 years ago
- Tableau 10 Business Intelligence Cookbook by Packt☆13Updated 2 years ago
- Udacity Data Engineering Nano Degree Project, Data Modeling for fact and dimension tables, and ETL pipeline that transfers data from file…☆9Updated 4 years ago
- Simple samples for writing ETL transform scripts in Python☆22Updated 3 years ago
- Project based learning for Data Engineering fundamentals.☆13Updated 4 years ago
- Basic tutorial of using Apache Airflow☆36Updated 6 years ago
- Profiles the data, validates the schema and runs data quality checks and produces a report☆20Updated 6 years ago
- Supplementary material for Building a Modern Data Platform with Snowflake, from Pearson.☆20Updated 3 years ago
- Road to Azure Data Engineer Part-I: DP-200 - Implementing an Azure Data Solution☆67Updated 4 years ago
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 4 years ago
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆26Updated 4 years ago
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- Complete Repository to become an expert is SQL Window Functions☆25Updated last year
- ☆14Updated 6 years ago
- My Git Repo for Csv Data☆21Updated 4 years ago
- A repository for materials used in Snowflake fundamentals bootcamp at O'Reilly Learning Platform☆13Updated last year
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆11Updated last year
- Spark app to merge different schemas☆23Updated 4 years ago
- Snowflake - Build and Architect Data Pipelines using AWS, published by Packt☆21Updated 2 years ago
- ☆64Updated last week
- Code Repository for GCP: Complete Google Data Engineer and Cloud Architect Guide(v), Published by Packt☆16Updated 2 years ago
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Updated 4 years ago