holdenk / diversity-analyticsLinks
Analytics on Apache Projects for Diversity
☆18Updated 5 years ago
Alternatives and similar repositories for diversity-analytics
Users that are interested in diversity-analytics are comparing it to the libraries listed below
Sorting:
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 8 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago
- (Deprecated) Task for the Search & Discovery data analyst job.☆21Updated 10 years ago
- Sample Notebooks for PipelineAI☆44Updated 2 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Sharing interesting and noteworthy Data Engineering content☆68Updated 8 years ago
- Real-world Spark pipelines examples☆83Updated 7 years ago
- notebooks for nlp-on-spark☆13Updated 8 years ago
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆67Updated 9 years ago
- ☆11Updated 6 years ago
- Public repository for the Search Fundamentals course taught by Daniel Tunkelang and Grant Ingersoll. Available at https://corise.com/cour…☆43Updated last year
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 9 years ago
- Spark Tutorial at the University of Maryland☆38Updated 10 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 9 years ago
- ☆16Updated 4 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 8 years ago
- ☆8Updated 7 years ago
- tutorials and samples that show you how get the most out of IBM Analytics for Apache Spark☆79Updated 7 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- Make your libraries magically appear in Databricks.☆47Updated last year
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago
- Mastering Spark for Data Science, published by Packt☆47Updated 2 years ago
- Airflow workflow management platform chef cookbook.☆71Updated 5 years ago
- ☆26Updated last year
- AWS Big Data Certification☆25Updated 4 months ago
- Know your ML Score based on Sculley's paper☆34Updated 6 years ago
- Some IPython notebooks I've created...☆29Updated 9 years ago
- Spoken dialogue querying for SQL databases.☆37Updated 8 years ago