MuttData / muttlib
☆29Updated last year
Alternatives and similar repositories for muttlib:
Users that are interested in muttlib are comparing it to the libraries listed below
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- Automated Jupyter notebook testing. 📙☆41Updated last year
- 🐾 PdpCLI is a pandas DataFrame processing CLI tool which enables you to build a pandas pipeline from a configuration file.☆15Updated last year
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- Sample projects using Ploomber.☆86Updated last year
- Best practices for engineering ML pipelines.☆35Updated 2 years ago
- ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.☆45Updated last month
- Open source bits of athenian-api.☆19Updated last year
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- Pandas helper functions☆30Updated 2 years ago
- ☆19Updated 4 years ago
- ☆26Updated 3 years ago
- Building an API with the FastAPI framework to serve a scikit-learn model.☆18Updated 6 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆78Updated 7 months ago
- A powerful data analysis package based on mathematical step functions. Strongly aligned with pandas.☆59Updated 3 months ago
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated last year
- SciKIt-learn Pipeline in PAndas☆42Updated last year
- mlctl is the control plane for MLOps. It provides a CLI and a Python SDK for supporting key operations related to MLOps, such as "model t…☆25Updated 3 years ago
- The easiest way to integrate Kedro and Great Expectations☆53Updated 2 years ago
- kedro plugin to automatically construct pipelines using pytest style pattern matching☆21Updated last year
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- PyCon Talks 2022 by Antoine Toubhans☆23Updated 2 years ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 3 years ago
- Using the Parquet file format with Python☆15Updated last year
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆84Updated last year
- Projects developed by Domino's R&D team☆76Updated 3 years ago
- Interactive cleaning for Pandas DataFrames☆15Updated 5 years ago
- Declarative layer for your database.☆37Updated 2 years ago