fugue-project / tutorialsLinks

Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask without any rewrites.

☆114

Alternatives and similar repositories for tutorials

Users that are interested in tutorials are comparing it to the libraries listed below

Sorting:

atoti / atoti
Notebook gallery and issue tracking for Atoti
☆227Updated 2 weeks ago
ibis-project / ibis-ml
IbisML is a library for building scalable ML pipelines using Ibis.
☆117Updated 4 months ago
getindata / kedro-kubeflow
Kedro Plugin to support running workflows on Kubeflow Pipelines
☆56Updated 5 months ago
canimus / cuallee
Possibly the fastest DataFrame-agnostic quality check library in town.
☆227Updated last month
fugue-project / tune
An abstraction layer for parameter tuning
☆35Updated 2 weeks ago
rasgointelligence / RasgoQL
Write python locally, execute SQL in your data warehouse
☆269Updated 3 years ago
alteryx / woodwork
Woodwork is a Python library that provides robust methods for managing and communicating data typing information.
☆155Updated 2 months ago
MrPowers / farsante
Fake Pandas / PySpark DataFrame creator
☆48Updated last year
MrPowers / beavis
Pandas helper functions
☆31Updated 2 years ago
ing-bank / popmon
Monitor the stability of a Pandas or Spark dataframe ⚙︎
☆509Updated 2 months ago
kedro-org / kedro-starters
Templates for your Kedro projects.
☆80Updated this week
ploomber / projects
Sample projects using Ploomber.
☆86Updated last year
anna-geller / prefect-deployment-patterns
Code examples showing flow deployment to various types of infrastructure
☆111Updated 2 years ago
tamsanh / kedro-great
The easiest way to integrate Kedro and Great Expectations
☆54Updated 2 years ago
jeppe742 / DeltaLakeReader
Read Delta tables without any Spark
☆47Updated last year
kedro-org / kedro-plugins
First-party plugins maintained by the Kedro team.
☆109Updated this week
danielbeach / tinytimmy
A simple and easy to use Data Quality (DQ) tool built with Python.
☆50Updated 2 years ago
drivendataorg / nbautoexport
Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.
☆83Updated 2 months ago
kedro-org / awesome-kedro
Plugins, extensions, case studies, articles, and video tutorials for Kedro
☆93Updated 11 months ago
Swiple / swiple
Swiple enables you to easily observe, understand, validate and improve the quality of your data
☆84Updated this week
poloclub / timbertrek
Explore and compare 1K+ accurate decision trees in your browser!
☆169Updated last year
dask-contrib / dask-snowflake
Dask integration for Snowflake
☆30Updated 3 months ago
MatsMoll / aligned
The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt
☆60Updated 2 weeks ago
great-expectations / great_expectations_action
A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.
☆81Updated last year
manoss96 / fluke
Cloud-agnostic Python API
☆60Updated last year
bytehub-ai / bytehub
ByteHub: making feature stores simple
☆61Updated 4 years ago
fal-ai / dbt_feature_store
Build your feature store with macros right within your dbt repository
☆39Updated 2 years ago
mitchelllisle / sparkdantic
✨ A Pydantic to PySpark schema library
☆112Updated last week
ploomber / soorgeon
Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊
☆79Updated last year
jacopotagliabue / paas-data-ingestion
Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner
☆71Updated 3 years ago