activescott / dbcexplode
Unpack the source files from a Databricks .dbc archive file.
☆26Updated 11 months ago
Alternatives and similar repositories for dbcexplode:
Users that are interested in dbcexplode are comparing it to the libraries listed below
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆103Updated 5 years ago
- ☆59Updated 3 years ago
- Repository of Notebooks taken from https://neo4j.com/graph-algorithms-book/☆26Updated 5 years ago
- 📝 A blog post about report generation and automation in python☆40Updated 5 years ago
- helpful resources for (big) data science☆33Updated 3 years ago
- AWS Big Data Certification☆25Updated 2 months ago
- MLFlow Spark Summit 2019 Presentation☆67Updated 5 years ago
- Quickstart PySpark with Anaconda on AWS/EMR☆53Updated 8 years ago
- Software Engineering Techniques Tutorial at SciPy 2018☆19Updated 6 years ago
- Sample Notebooks for PipelineAI☆44Updated 2 years ago
- library for conducting propensity matching on spark scale☆14Updated last year
- Using Kafka-Python to illustrate a ML production pipeline☆109Updated 2 years ago
- MLOps simplified. One platform, all the functionality you need. Swiss made☆98Updated this week
- Distributed Deep Learning using AzureML☆41Updated 5 years ago
- DevOps for AI project using Azure Databricks, Azure DevOps and Azure Machine Learning Service☆16Updated 3 years ago
- Using Apache Airflow to author, run and monitor complex data pipelines.☆12Updated 6 years ago
- Capturing model drift and handling its response - Example webinar☆107Updated 5 years ago
- This project is created to promote and advocate the use of FOSS machine learning.☆43Updated last month
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 6 years ago
- Python library for deploying models built using Python to Alteryx Promote.☆15Updated 3 years ago
- Python bindings for the Domino APIs☆53Updated 3 weeks ago
- Code supporting Data Science articles at The Marketing Technologist, Floryn Tech Blog, and Pythom.nl☆71Updated 2 years ago
- Explore tips and tricks to deploy machine learning models with Docker.☆13Updated last year
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- A Jupyter Notebook I made to try out dask's Dataframes☆27Updated 6 years ago
- Source code for 'PySpark Recipes' by Raju Kumar Mishra☆25Updated 5 years ago
- A frictionless integrated platform for notebook☆85Updated 2 years ago
- Set of iPython and Jupyter extensions to improve user experience☆50Updated 5 years ago
- Modern Techniques for Data Science with Big Datasets