richardanaya / spark_delta_lake
☆16Updated 4 years ago
Alternatives and similar repositories for spark_delta_lake:
Users that are interested in spark_delta_lake are comparing it to the libraries listed below
- A simple introduction to using spark ml pipelines☆26Updated 6 years ago
- Some wrappers around python modules for simplifying the data exploration process.☆13Updated 3 months ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago
- Analytics on Apache Projects for Diversity☆18Updated 5 years ago
- Common post-estimation tasks for scikit-learn☆17Updated 8 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 9 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Conversion utility from Zeppelin notes to Jupyter notebooks.☆44Updated 5 years ago
- A short guide for transitioning from Python to Scala☆65Updated 9 years ago
- Code and setup information for Introduction to Machine Learning with Spark☆12Updated 9 years ago
- Know your ML Score based on Sculley's paper☆34Updated 5 years ago
- An in depth tutorial on sklearn's Pipeline and FeatureUnion classes.☆16Updated 7 years ago
- Workshop for Spark and Databricks☆54Updated 5 years ago
- ☆29Updated 8 years ago
- Dask tutorial for PyData DC 2016☆11Updated 8 years ago
- Sample repo for luigi tasks & config☆36Updated 8 years ago
- Test suite to document the behavior of Spark☆21Updated 3 years ago
- ETL data pipeline for SixFifty modelling & analytics☆13Updated 5 years ago
- Simple validator for submissions to DrivenData competitions☆19Updated 5 years ago
- An example PySpark project with pytest☆17Updated 7 years ago
- Articles on Data Science, Jupyter, and Pandas☆18Updated 9 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- The ultimate twitter streaming data collector☆40Updated 8 years ago
- ☆24Updated 6 years ago
- ☆34Updated 8 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 6 years ago
- Collection of dask example notebooks☆58Updated 7 years ago
- Updated 9 years ago