A repository of Apache Spark projects, training projects, and tutorials, in both Scala and Python.
☆32Sep 15, 2021Updated 4 years ago
Alternatives and similar repositories for spark
Users that are interested in spark are comparing it to the libraries listed below
Sorting:
- simulation/RL - multi-agent car parking using reinforcement learning☆12Aug 4, 2024Updated last year
- remote log tail☆12Nov 2, 2017Updated 8 years ago
- Projects for Udacity Data Scientist Nanodegree☆11Dec 24, 2022Updated 3 years ago
- Sample app for a Python API using FastAPI and neomodel☆11Jul 1, 2024Updated last year
- Let's build an image filter app from scratch☆11Apr 13, 2023Updated 2 years ago
- A dataset of 'historical' data, useful for munging/ cleaning practice☆13Mar 12, 2018Updated 7 years ago
- Add .waitForUrl() to Nightmare☆10Aug 23, 2016Updated 9 years ago
- Proof of concept lambda for massive parallelism☆10Nov 2, 2018Updated 7 years ago
- Jupyter Notebook for PyData DC 2016 on "Making Your Code Faster: Cython and parallel processing in the Jupyter Notebook"☆12Nov 12, 2016Updated 9 years ago
- Linux Administration Bootcamp Go from Beginner to Advanced, published by Packt☆12Jan 30, 2023Updated 3 years ago
- ☆62Jan 9, 2024Updated 2 years ago
- "Not too complicated" training code for CIFAR-10 by PyTorch Lightning☆12Jun 5, 2022Updated 3 years ago
- A Python implementation of the card game Crazy Eights, adapted from "Hello World" by Warren and Carter Sande☆10May 28, 2017Updated 8 years ago
- ☆11Dec 2, 2016Updated 9 years ago
- The objective of Cloud Builders' Day repository is to provide do-it-yourself lab guides for several AWS services including but not limite…☆11Aug 20, 2020Updated 5 years ago
- ☆16Jul 10, 2019Updated 6 years ago
- A Gentle Introduction to Deep Learning with TensorFlow (PyCon 2017 talk)☆14May 30, 2017Updated 8 years ago
- Uses RNN on the Nietzsche dataset☆15May 28, 2017Updated 8 years ago
- ☆15Aug 11, 2022Updated 3 years ago
- An elegant HTTP/s Client library that helps you scale much better than requests☆20Apr 3, 2025Updated 11 months ago
- A complete data engineering project demonstrating modern data stack practices with Apache Flink, Iceberg, Trino and Superset☆20Sep 29, 2025Updated 5 months ago
- Notebooks for deep learning course☆14Jan 6, 2022Updated 4 years ago
- Azure Data Factory Cookbook_Second Edition, published by Packt☆19Feb 29, 2024Updated 2 years ago
- Repo of code for FP-Scanner article☆13May 30, 2018Updated 7 years ago
- Series around DevOps techniques for data platforms☆15Dec 7, 2022Updated 3 years ago
- This repository contains my solutions to the intermediate and advanced problems from the "SQL Practice Problems" book by Sylvia Moestl Va…☆14Jul 30, 2022Updated 3 years ago
- Exploration of timeseries LSTM RNN prediction in pytorch, keras, tensorflow, and tensorflow.contrib.keras.☆16Feb 12, 2018Updated 8 years ago
- Demo bot for Random Access Navigation☆13May 8, 2017Updated 8 years ago
- Demo code created for https://robkerr.ai☆17Dec 1, 2025Updated 3 months ago
- Nightmare plugin used to retrieve the network activity of a web page in HAR (HTTP Archive) format☆17Sep 14, 2017Updated 8 years ago
- Repo for Deep Learning Projects in NLP, GANs, Computer Vision☆19May 6, 2018Updated 7 years ago
- scikit-learn course for 2017 NGCM Summer Academy☆17Jun 30, 2017Updated 8 years ago
- Natural Language Processing Examples with python☆19May 30, 2019Updated 6 years ago
- Jupyter notebooks for "Fluent Python", by Luciano Ramalho☆14Jun 2, 2017Updated 8 years ago
- AWS Quick Start Team☆23Oct 3, 2024Updated last year
- Python interface for building rule-based expert systems over PyCLIPS☆14Nov 18, 2022Updated 3 years ago
- ☆51Sep 10, 2025Updated 5 months ago
- ☆21Nov 21, 2023Updated 2 years ago
- Python bindings for Stanford CoreNLP's protobufs.☆20Jul 23, 2018Updated 7 years ago