sahilbhange / Facebook-Data-Extraction
#DataPipeLine #ETL - Created is a Facebook data extraction utility to extract the publicly available data on Facebook. Used Facebook Graph API and Python to extract the data and loaded the data into the CSV files for further analysis.
☆14Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for Facebook-Data-Extraction
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 5 years ago
- A curated list of awesome Databricks resources, including Spark☆14Updated 4 months ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- Learn how to auto-ingest streaming data into Snowflake using Snowpipe.☆23Updated 2 years ago
- 3NF-normalize Yelp data on S3 with Spark and load it into Redshift - automate the whole thing with Apache Airflow☆12Updated 5 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆26Updated 2 years ago
- Big Data Demystified meetup and blog examples☆31Updated 3 months ago
- This repository contains code to build an MVP search engine with google like interface.☆16Updated 4 years ago
- Learning Google BigQuery, published by Packt☆14Updated last year
- Azure Data Engineering Cookbook 2nd-edition, published by Packt☆31Updated last year
- Spark app to merge different schemas☆23Updated 3 years ago
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Updated 4 years ago
- Simple samples for writing ETL transform scripts in Python☆22Updated 3 years ago
- Simple ETL pipeline using Python☆21Updated last year
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆15Updated 9 months ago
- ☆14Updated 5 years ago
- Codeless Deep Learning with KNIME☆13Updated last year
- Public source code for the Batch Processing with Apache Beam (Python) online course☆19Updated 4 years ago
- Full stack data engineering tools and infrastructure set-up☆44Updated 3 years ago
- This repository contains example implementations for KNIME Analytics Platform.☆16Updated 3 months ago
- Machine Learning in Snowflake☆24Updated 5 years ago
- Source code for 'BigQuery for Data Warehousing' by Mark Mucchetti☆15Updated 4 years ago
- Scripts complement the Optimizing a Data Vault data warehouse on the Snowflake Cloud Data Platform webinar☆14Updated 4 years ago
- BigQuery Schema Conversion Tool☆23Updated 4 years ago
- Project based learning for Data Engineering fundamentals.☆13Updated 3 years ago
- build dw with dbt☆29Updated 3 weeks ago
- ☆26Updated 4 years ago
- Code for my "Efficient Data Processing in SQL" book.☆50Updated 3 months ago
- A Postgres data warehouse for processing synthetic data using IAC principles☆16Updated last year