madhup / libifyLinks
Import Databricks notebooks as libraries/modules
☆15Updated 3 years ago
Alternatives and similar repositories for libify
Users that are interested in libify are comparing it to the libraries listed below
Sorting:
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆102Updated 6 years ago
- Primrose modeling framework for simple production models☆33Updated last year
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 7 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆80Updated 2 years ago
- ☆96Updated 5 years ago
- Apache Avro <-> pandas DataFrame☆137Updated 5 months ago
- Distributed Bayesian Entity Resolution in Apache Spark☆59Updated 4 years ago
- HandySpark - bringing pandas-like capabilities to Spark dataframes☆197Updated 6 years ago
- The Data Linter identifies potential issues (lints) in your ML training data.☆88Updated 8 years ago
- Public repository for versioning machine learning data☆42Updated 4 years ago
- Joblib Apache Spark Backend☆249Updated 10 months ago
- 🎯 kettle is a CLI tool for creating and deploying cloud functions & docker containers for machine learning☆31Updated 3 years ago
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbation…☆164Updated 7 months ago
- Embed categorical variables via neural networks.☆59Updated 2 years ago
- python library for automated dataset normalization☆117Updated 2 years ago
- PySpark phonetic and string matching algorithms☆41Updated last year
- Know your ML Score based on Sculley's paper☆34Updated 6 years ago
- Lightweight, Python library for fast and reproducible experimentation☆136Updated 7 years ago
- Useful decorators every Data Scientist should know☆29Updated 3 years ago
- General Interpretability Package☆58Updated 3 years ago
- Tools that make working with scikit-learn and pandas easier.☆44Updated 2 months ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆86Updated 2 years ago
- Read Delta tables without any Spark☆47Updated last year
- 🧮 Extended Latent Dirichlet Allocation for Collaborative Filtering in Recommender Systems.☆42Updated 3 years ago
- Scripts for paper "Encoding high-cardinality string categorical variables"☆24Updated 6 years ago
- Python application to setup and run streaming (contextual) bandit experiments.☆83Updated 5 months ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆232Updated 2 years ago
- Record matching and entity resolution at scale in Spark☆36Updated 2 years ago
- A tool and library for easily deploying applications on Apache YARN☆146Updated last year
- a python grammar for evolutionary algorithms and heuristics☆192Updated 3 years ago