richardanaya / spark_delta_lake
☆16Updated 4 years ago
Alternatives and similar repositories for spark_delta_lake:
Users that are interested in spark_delta_lake are comparing it to the libraries listed below
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- Sample repo for luigi tasks & config☆36Updated 8 years ago
- Some wrappers around python modules for simplifying the data exploration process.☆13Updated 5 months ago
- ☆25Updated 6 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago
- pyspark sample scripts☆17Updated 6 years ago
- ☆24Updated 6 years ago
- Dask tutorial for PyData DC 2016☆11Updated 8 years ago
- Tutorial repo for the article "ML in Production"☆30Updated 2 years ago
- Natural Language Processing with Spark's MLlib☆62Updated 7 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 9 years ago
- Materials for Apache Arrow workshop at VLDB 2019☆42Updated 4 years ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- Analytics on Apache Projects for Diversity☆18Updated 5 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 4 years ago
- Conversion utility from Zeppelin notes to Jupyter notebooks.☆44Updated 5 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 8 years ago
- spark (scala and python)☆18Updated 5 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- Code and notebooks for a talk given at PyBay, 2018-08-19☆48Updated 4 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- An example PySpark project with pytest☆16Updated 7 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 8 years ago
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- ☆20Updated 8 years ago
- Material for some talks I have given☆62Updated 7 months ago
- Workshop for Spark and Databricks☆54Updated 5 years ago
- Fast, accurate, lightweight, multi-core ML in Python, leveraging Vowpal Wabbit☆21Updated 6 years ago