holdenk / diversity-analyticsLinks
Analytics on Apache Projects for Diversity
☆18Updated 6 years ago
Alternatives and similar repositories for diversity-analytics
Users that are interested in diversity-analytics are comparing it to the libraries listed below
Sorting:
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- CLI tool to launch Spark jobs on AWS EMR☆67Updated last year
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆67Updated 9 years ago
- tutorials and samples that show you how get the most out of IBM Analytics for Apache Spark☆79Updated 7 years ago
- Airflow workflow management platform chef cookbook.☆71Updated 6 years ago
- ☕⛵WIP PySpark dependency management☆22Updated 7 years ago
- This project is created to promote and advocate the use of FOSS machine learning.☆46Updated 2 months ago
- Sharing interesting and noteworthy Data Engineering content☆68Updated 8 years ago
- Coding exercises for Apache Spark☆104Updated 10 years ago
- An example PySpark project with pytest☆16Updated 7 years ago
- Snippets of code used in blog posts and other media.☆13Updated this week
- Open source Flotilla☆194Updated last month
- PySpark phonetic and string matching algorithms☆39Updated last year
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 9 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Conversion utility from Zeppelin notes to Jupyter notebooks.☆44Updated 5 years ago
- S3 backed ContentsManager for jupyter notebooks☆14Updated 9 years ago
- Spark Tutorial at the University of Maryland☆38Updated 10 years ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Real-world Spark pipelines examples☆83Updated 7 years ago
- Events about the open source data stack☆13Updated 3 years ago
- Make your libraries magically appear in Databricks.☆47Updated last year
- Airflow plugin to transfer arbitrary files between operators☆78Updated 6 years ago
- Sample Notebooks for PipelineAI☆44Updated 2 years ago
- A Getting Started Guide for developing and using Airflow Plugins☆93Updated 6 years ago
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Updated 4 years ago
- Some class materials for a data processing course using PySpark☆52Updated 2 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 8 years ago
- ☆15Updated 8 years ago