Repository for Apache Spark course at Team Data Science
☆17Oct 23, 2020Updated 5 years ago
Alternatives and similar repositories for learning-apache-spark
Users that are interested in learning-apache-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for the Document streaming capstone projects☆12Nov 17, 2025Updated 6 months ago
- Course Material Data Engineering on AWS Course☆31Sep 9, 2024Updated last year
- ☆15Jul 1, 2021Updated 4 years ago
- Sample Project to Learn Data Engineering☆10Aug 1, 2021Updated 4 years ago
- Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark☆11May 22, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Dockerizing a Python Script for Web Scraping and consume the scraped data using FastApi (www.metroscubicos.com)☆15Dec 16, 2021Updated 4 years ago
- Workshop for 2020 Apache Beam Summit: using Beam to build data pipelines for deep learning.☆11Aug 24, 2020Updated 5 years ago
- ☆17Nov 12, 2022Updated 3 years ago
- Data sets and ML models versioning example from DVC get started☆10Jun 4, 2024Updated 2 years ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-dataform☆13Jun 6, 2023Updated 3 years ago
- Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag☆23Sep 19, 2022Updated 3 years ago
- A collection of Demos for Google Cloud Databases☆19Dec 4, 2025Updated 6 months ago
- Code Repository for GCP: Complete Google Data Engineer and Cloud Architect Guide(v), Published by Packt☆16Jan 30, 2023Updated 3 years ago
- Challenge Data Engineer☆25Jun 13, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Labs and demos for courses in the Data Engineer track of GCP Training (http://cloud.google.com/training).☆15Oct 28, 2019Updated 6 years ago
- Webflow by Example, published by Packt☆12Jan 24, 2023Updated 3 years ago
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆27Jul 23, 2020Updated 5 years ago
- ☆14Feb 19, 2024Updated 2 years ago
- Using Time Series Forecasting , we can study the pattern of energy Consumptionin in a general household , which can predict the estimated…☆14Oct 18, 2023Updated 2 years ago
- This is a simple Python library for interacting with the REST interface of an instance of Cordra☆10May 20, 2022Updated 4 years ago
- ☆19Apr 21, 2021Updated 5 years ago
- ☆31Dec 26, 2025Updated 5 months ago
- Vim Python Extension☆16Oct 10, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- data and code for scrapping and cleaning data on covid-19 in India from https://www.mohfw.gov.in/ and https://www.covid19india.org/☆43Oct 1, 2020Updated 5 years ago
- In which I implement some applications of machine learning techniques.☆32May 10, 2016Updated 10 years ago
- Starter Code for BNR React Testing Workshop☆12Apr 18, 2023Updated 3 years ago
- A self-contained, queryable knowledge graph of tech skills and IT stuff; maintained with git☆18Nov 14, 2023Updated 2 years ago
- Pipeline for processing JWST imaging data, tailored for nearby galaxies. Built for PHANGS☆21Updated this week
- ☆12Apr 21, 2021Updated 5 years ago
- This repository contains the data and the code associated to the paper "Hyper-cores promote localization and efficient seeding in higher-…☆12Oct 6, 2023Updated 2 years ago
- Implementing RAG with Amazon Bedrock, Amazon Titan, and Amazon OpenSearch Serverless☆11Oct 9, 2023Updated 2 years ago
- Visually query Spanner Graph data in notebooks☆40Apr 16, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆38Jul 18, 2023Updated 2 years ago
- 🔌 Flask S3Viewer is a powerful extension that makes it easy to browse S3 in any Flask application. (Python S3 Uploader / Flask S3 Upload…☆13May 20, 2026Updated 3 weeks ago
- Interactive didactic simulation of a Hopfield network, a type of neural network that models associative memory.☆16Feb 20, 2021Updated 5 years ago
- Examples of using IBM data prep kit☆32Nov 20, 2025Updated 6 months ago
- Automating Your Data Pipeline with Apache Airflow☆40Sep 1, 2023Updated 2 years ago
- This is a repository for the LinkedIn Learning course Practical Python for Data Professionals☆50Jun 12, 2024Updated 2 years ago
- ☆17Sep 5, 2023Updated 2 years ago