madhup / libifyLinks
Import Databricks notebooks as libraries/modules
☆15Updated 3 years ago
Alternatives and similar repositories for libify
Users that are interested in libify are comparing it to the libraries listed below
Sorting:
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆102Updated 6 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆80Updated 2 years ago
- Primrose modeling framework for simple production models☆33Updated last year
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated 2 years ago
- Read Delta tables without any Spark☆47Updated last year
- Export and import MLflow experiments, runs or registered models☆80Updated 3 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 7 years ago
- PySpark phonetic and string matching algorithms☆41Updated last year
- Projects developed by Domino's R&D team☆77Updated 3 years ago
- MLflow samples - deprecated☆22Updated 2 years ago
- HandySpark - bringing pandas-like capabilities to Spark dataframes☆197Updated 6 years ago
- 🍦 Deployment tool for online machine learning models☆98Updated 3 years ago
- python library for automated dataset normalization☆117Updated 2 years ago
- Joblib Apache Spark Backend☆249Updated 10 months ago
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbation…☆164Updated 7 months ago
- Version Control for Machine Learning☆43Updated 3 years ago
- Tutorial for a new versioning Machine Learning pipeline☆80Updated 4 years ago
- Asynchronous actions for PySpark☆48Updated 4 years ago
- The Data Linter identifies potential issues (lints) in your ML training data.☆88Updated 8 years ago
- Machine learning evaluation database☆24Updated 7 years ago
- a python grammar for evolutionary algorithms and heuristics☆192Updated 3 years ago
- Test-Driven Data Analysis Functions☆302Updated this week
- A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.☆66Updated 8 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…☆53Updated 2 years ago
- 🧮 Extended Latent Dirichlet Allocation for Collaborative Filtering in Recommender Systems.☆42Updated 3 years ago
- ☆96Updated 5 years ago
- Public repository for versioning machine learning data☆42Updated 4 years ago
- scaffold of Apache Airflow executing Docker containers☆85Updated 3 years ago
- vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distr…☆120Updated last year
- Create HTML profiling reports from Apache Spark DataFrames☆197Updated 6 years ago