activescott / dbcexplode
Unpack the source files from a Databricks .dbc archive file.
☆23Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for dbcexplode
- AWS Big Data Certification☆25Updated last year
- MLflow App Library☆75Updated 5 years ago
- library for conducting propensity matching on spark scale☆14Updated last year
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- 📝 A blog post about report generation and automation in python☆40Updated 5 years ago
- Repository of Notebooks taken from https://neo4j.com/graph-algorithms-book/☆26Updated 4 years ago
- helpful resources for (big) data science☆33Updated 3 years ago
- Software Engineering Techniques Tutorial at SciPy 2018☆19Updated 6 years ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-dataproc☆48Updated last year
- Sample Notebooks for PipelineAI☆44Updated 2 years ago
- Business Data Analysis by HiPIC of CalStateLA☆20Updated 6 years ago
- bamboolib - template for creating your own binder notebook☆21Updated 2 years ago
- ☆60Updated 2 years ago
- Workshop for Spark and Databricks☆54Updated 4 years ago
- Machine Learning Virtual Machine (provisioned with Vagrant) for building Spark Notebook applications☆54Updated 7 months ago
- ☆16Updated last year
- A simple introduction to using spark ml pipelines☆26Updated 6 years ago
- ☆11Updated 6 years ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- Python library for deploying models built using Python to Alteryx Promote.☆16Updated 2 years ago
- Notebooks which will provide a demo of Qgrid functionality☆20Updated 4 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 8 years ago
- A simple Spark TDD example☆26Updated 7 years ago
- Analyzing and calculating key marketing metrics with SQL and Python☆14Updated 5 years ago
- The Art of Data Science☆34Updated 5 years ago
- Managed Machine Learning Systems and Internet of Things Live Lesson☆39Updated 4 years ago
- A Scalable Data Cleaning Library for PySpark.☆26Updated 5 years ago
- Code repository for Learning Apache Spark 2, published by Packt☆20Updated last year
- Articles on machine learning☆62Updated 2 years ago
- notebooks for nlp-on-spark☆13Updated 7 years ago