oneoffcoder / py-pair
Pairwise association measures of statistical variable types
☆21Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for py-pair
- R-like formula approach to Spark Dataframes☆10Updated 3 years ago
- A collection of pedantic docker containers.☆30Updated last year
- A collection of online books for data science, computer science and coding!☆70Updated 7 months ago
- Inference in Bayesian Belief Networks using Probability Propagation in Trees of Clusters (PPTC) and Gibbs sampling☆55Updated 8 months ago
- A project that creates a VM with single node setup of Hadoop v2.3.0 with YARN installed.☆19Updated 10 years ago
- A tutorial on how to use kedro-mlflow plugin (https://github.com/Galileo-Galilei/kedro-mlflow) to synchronize training and inference and …☆37Updated 2 years ago
- Record matching and entity resolution at scale in Spark☆31Updated last year
- Automatically transform all categorical, date-time, NLP variables to numeric in a single line of code for any data set any size.☆64Updated 9 months ago
- An abstraction layer for parameter tuning☆36Updated 2 months ago
- ☆22Updated 2 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆111Updated 7 months ago
- Predict the poverty of households in Costa Rica using automated feature engineering.☆23Updated 4 years ago
- TSForecasting: Automated Time Series Forecasting Framework☆26Updated last week
- Elo ratings for time-series forecasting packages☆23Updated 2 years ago
- ☆21Updated 10 months ago
- Render Jupyter Notebooks With Metaflow Cards☆24Updated last month
- Playing with Python Bluesky SDK☆13Updated this week
- A set of utilities to quicky analyze time series.☆22Updated 3 years ago
- ☆20Updated 3 years ago
- ElasticSearch implementation of MlFlow tracking store☆16Updated 4 years ago
- ☆28Updated last month
- Process, visualize and use data easily.☆20Updated last year
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB worker…☆18Updated 11 months ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- ☆13Updated last year
- Templates for your Kedro projects.☆67Updated this week
- KeypartX is a graph-based approach to represent perception (text in general) by key parts of speech.☆0Updated last year
- A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.☆13Updated 3 years ago
- Orchestrate Modal and OpenAI workloads with Dagster☆12Updated last month