code, labs and lectures for the course
☆48Apr 16, 2023Updated 3 years ago
Alternatives and similar repositories for architect_big_data_solutions_with_spark
Users that are interested in architect_big_data_solutions_with_spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Managing machine learning life-cycle with MLflow tutorial☆23May 1, 2023Updated 3 years ago
- A simple intro lab to Azure Databricks that gives users a flavour of both data engineering and data science with Azure Databricks.☆14Aug 8, 2018Updated 7 years ago
- Packt courseware source code for "Beginning Data Science with Jupyter"☆15Jan 5, 2020Updated 6 years ago
- This is part of the Artificial Intelligence live course, hosted by Packtpub. In this repository, you can find information to build your e…☆15Feb 19, 2019Updated 7 years ago
- ☆10Nov 29, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆37May 27, 2025Updated last year
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆12Jul 16, 2019Updated 6 years ago
- Labs to help you get started with Azure Data Factory☆33May 8, 2020Updated 6 years ago
- This project deals with vulnerability analysis and classification using machine learning techniques i.e. Natural Language Processing.☆10Feb 21, 2019Updated 7 years ago
- A ultra-lightweight 3D renderer of the Tensorflow/Keras neural network architectures☆19Oct 27, 2021Updated 4 years ago
- Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )☆93Jun 16, 2018Updated 7 years ago
- ☆16Jun 20, 2019Updated 6 years ago
- ☆53Updated this week
- Following "Pure React" book☆11Dec 1, 2017Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Programming of Simulation, Analysis, and Learning Systems Course Materials☆42Oct 27, 2025Updated 7 months ago
- Supplementary material for Building a Modern Data Platform with Snowflake, from Pearson.☆20Dec 28, 2021Updated 4 years ago
- Capstone project for Galvanize - Data Science Immersive. 'Project Plotline' looks at the emotional content of movie scripts (web scraping…☆16Sep 27, 2016Updated 9 years ago
- Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot more☆18Jun 21, 2022Updated 3 years ago
- Azure Cosmos DB - Custom Point in Time Restore☆12Dec 7, 2022Updated 3 years ago
- My Git Repo for Csv Data☆21Oct 5, 2025Updated 7 months ago
- Instant search for and access to many datasets in Pyspark.☆34Oct 6, 2022Updated 3 years ago
- A curated list of awesome Databricks resources, including Spark☆22Jun 28, 2024Updated last year
- Automatic Time Series Forecasting and Missing Values Imputation☆19Nov 4, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This reference architecture demonstrates the use of AWS Step Functions to orchestrate an Extract Transfer Load (ETL) workflow with AWS La…☆24Jun 16, 2020Updated 5 years ago
- This repository contains code that was used as an example of how to use Python to download part of the AudioSet dataset and use Tensorflo…☆13Aug 24, 2017Updated 8 years ago
- ☆10Jun 5, 2021Updated 4 years ago
- Here I show how to use Deep Learning for biological and biomedical Data Integration.☆11Sep 17, 2020Updated 5 years ago
- Framework to make bots based on Microsoft Bot Framework.☆13Oct 5, 2018Updated 7 years ago
- A repo to track data engineering projects☆13Nov 11, 2022Updated 3 years ago
- library for conducting propensity matching on spark scale☆14Jun 27, 2023Updated 2 years ago
- Azure Data Factory hands-on lab, self-paced. Learn how to lift & shift SSIS packages to the Cloud with ADF. Build new ETL pipelines in AD…☆137Feb 4, 2024Updated 2 years ago
- Microsoft Ignite Learning Path, Train the Trainer materials: Modern Data Warehouse (Data)☆40Mar 19, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆13Oct 21, 2021Updated 4 years ago
- Life-cycle: Internal working of HDFS, SQOOP, HIVE, SPARK, HBASE, KAFKA with code.☆15Sep 10, 2019Updated 6 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Nov 9, 2021Updated 4 years ago
- A JupyterLab extension for displaying dashboards of GPU usage.☆13Aug 24, 2023Updated 2 years ago
- This extension for Visual Studio code enables you to click on Angular selectors in HTML files and be redirected to their definition in th…☆14Jul 21, 2018Updated 7 years ago
- This repository will contain the presentation and python jupyter notebooks for my DataHack Summit 2025 conference talk, Building Effectiv…☆76Aug 25, 2025Updated 9 months ago
- Sqlite3-based logging for Python☆15May 27, 2024Updated 2 years ago