jpzk / cookiecutter-scala-spark
A cookiecutter template for Apache Spark applications written in Scala
☆10Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for cookiecutter-scala-spark
- High Level Kafka Scanner☆19Updated 7 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- Collection of dask example notebooks☆57Updated 6 years ago
- feng - feature engineering for machine-learning champions☆27Updated 7 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- Materials for dask talk at PyData NYC☆15Updated 9 years ago
- Natural Language Processing with Spark's MLlib☆62Updated 7 years ago
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Updated 8 years ago
- Analysis pipeline for quick ML analyses.☆11Updated 6 years ago
- Proposals for new Jupyter subprojects to enter into incubation☆18Updated 4 years ago
- Migrate an SQL db to Νeo4j graph db☆34Updated 4 years ago
- An example project for doing grid search in MLlib☆13Updated 9 years ago
- ☆26Updated 10 months ago
- Tools for performing hyperparameter search with Scikit-Learn and Dask http://dask-searchcv.readthedocs.io☆11Updated 7 years ago
- Articles on Data Science, Jupyter, and Pandas☆18Updated 9 years ago
- Notes on Lambda Architecture☆12Updated 6 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 2 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- python library for interacting with SolrCloud☆36Updated 3 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 8 years ago
- Docker compose files for various kafka stacks☆33Updated 6 years ago
- PySpark for Elastic Search☆55Updated 7 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 6 years ago
- Using Pandas easily with Cassandra☆23Updated 6 years ago
- Spark Tutorial at the University of Maryland☆38Updated 10 years ago
- Code and setup information for Introduction to Machine Learning with Spark☆12Updated 9 years ago
- Geo-Located Data: Extracting Patterns from Mobile Data using Scikit-Learn and Cassandra☆30Updated 6 years ago
- Fast, easy and intuitive machine learning prototyping.☆124Updated 10 years ago