jpzk / cookiecutter-scala-sparkLinks
A cookiecutter template for Apache Spark applications written in Scala
☆10Updated 6 years ago
Alternatives and similar repositories for cookiecutter-scala-spark
Users that are interested in cookiecutter-scala-spark are comparing it to the libraries listed below
Sorting:
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- High Level Kafka Scanner☆19Updated 7 years ago
- Notes on Lambda Architecture☆12Updated 7 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- An example project for doing grid search in MLlib☆13Updated 10 years ago
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Updated 9 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- A quick start project for polyaxon☆29Updated 10 months ago
- Code reference from my Qbox blog posts.☆87Updated 9 years ago
- TAC is an airflow plugin which helps you to Extract transform and Load your data, bit more easily☆9Updated 7 years ago
- Using Pandas easily with Cassandra☆23Updated 7 years ago
- Proposals for new Jupyter subprojects to enter into incubation☆18Updated 4 years ago
- Aho-Corasick string replacement utility☆24Updated 5 years ago
- Spark Parameter Optimization and Tuning☆31Updated 7 years ago
- ☆16Updated 8 years ago
- Python Streaming Pipelines with Beam on Flink - Demo☆14Updated 2 years ago
- Fast, easy and intuitive machine learning prototyping.☆124Updated 11 years ago
- ☆26Updated last year
- These are the IPython notebook files for the CSC 432 Spring '13 course.☆23Updated 10 years ago
- Affinity Propagation on Spark☆19Updated 4 years ago
- Analysis pipeline for quick ML analyses.☆11Updated 6 years ago
- Spark Tutorial at the University of Maryland☆38Updated 10 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- Spark Application UI extension for JupyterLab☆10Updated 3 years ago
- dllib is a distributed deep learning library running on Apache Spark☆32Updated 7 years ago
- Collection of dask example notebooks☆58Updated 7 years ago
- An API for Distributed Machine Learning☆155Updated 8 years ago
- A short guide for transitioning from Python to Scala☆65Updated 9 years ago