MuttData / muttlib
β30Updated last year
Alternatives and similar repositories for muttlib:
Users that are interested in muttlib are comparing it to the libraries listed below
- Comparing Polars to Pandas and a small introductionβ43Updated 3 years ago
- πΎ PdpCLI is a pandas DataFrame processing CLI tool which enables you to build a pandas pipeline from a configuration file.β15Updated last year
- Automated Jupyter notebook testing. πβ41Updated last year
- Set-oriented Operations in Pandasβ24Updated 4 years ago
- sample code for tech blog post "Porting Flask to FastAPI for ML Model Serving"β29Updated last year
- Build your feature store with macros right within your dbt repositoryβ38Updated 2 years ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API πβ53Updated 3 years ago
- Automated Exploratory Data Analysis. Simplifying Data Explorationβ34Updated 4 years ago
- Python Data Collection Libraryβ45Updated 3 years ago
- Sample projects using Ploomber.β85Updated last year
- Blog post on ETL pipelines with Airflowβ23Updated 4 years ago
- β19Updated 3 years ago
- β26Updated 2 years ago
- Pandas helper functionsβ30Updated last year
- A powerful data analysis package based on mathematical step functions. Strongly aligned with pandas.β59Updated 3 weeks ago
- dagster scikit-learn pipeline example.β44Updated last year
- A utility for labeling clusters of text data.β28Updated 3 years ago
- datascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforestβ¦β58Updated 3 years ago
- Convert monolithic Jupyter notebooks π into maintainable Ploomber pipelines. πβ78Updated 4 months ago
- A framework-agnostic datasets library for Machine Learning research and education.β18Updated 2 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withouβ¦β113Updated 10 months ago
- Cloud-agnostic Python APIβ61Updated 7 months ago
- Interactive cleaning for Pandas DataFramesβ15Updated 5 years ago
- kedro plugin to automatically construct pipelines using pytest style pattern matchingβ21Updated last year
- β17Updated last year
- A very simple "hello world" project for deploying Prefect 2 to a docker container on Google Compute Engine.β11Updated 2 years ago
- PyCon Talks 2022 by Antoine Toubhansβ23Updated 2 years ago
- Material for Talk Python Training course on Getting Started with Dask.β28Updated 2 years ago
- Instant search for and access to many datasets in Pyspark.β34Updated 2 years ago
- DataHub on AWS demonstration resourcesβ10Updated 2 years ago