ploomber / jupysql
Better SQL in Jupyter. π
β693Updated last week
Related projects: β
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β1,968Updated last month
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)β860Updated this week
- A curated list of Polars talks, tools, examples & articles. Contributions welcome !β702Updated this week
- Efficient data transformation and modeling framework that is backwards compatible with dbt.β1,612Updated this week
- Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lineβ¦β661Updated 4 months ago
- Lightweight and extensible compatibility layer between dataframe libraries!β392Updated this week
- data load tool (dlt) is an open source Python library that makes data loading easy π οΈβ2,307Updated this week
- Distributed DataFrame for Python designed for the cloud, powered by Rustβ2,080Updated this week
- Turning PySpark Into a Universal DataFrame APIβ277Updated this week
- Monte Carlo simulation of the NBA season, leveraging dbt, duckdb and evidence.devβ421Updated this week
- SQLAlchemy driver for DuckDBβ336Updated this week
- Python Stream Processingβ1,467Updated this week
- Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadaβ¦β1,723Updated this week
- Making data lake work for time seriesβ1,121Updated 3 weeks ago
- Dagster Labs' open-source data platform, built with Dagster.β270Updated this week
- Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.β1,008Updated 2 months ago
- Monitor the stability of a Pandas or Spark dataframe βοΈβ493Updated 2 months ago
- Automatically profile dataframes in the Jupyter sidebarβ340Updated 7 months ago
- Apache DataFusion Python Bindingsβ346Updated last week
- A Postgres Proxy Server in Pythonβ225Updated this week
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamiltonβ863Updated last year
- π¦ A curated list of awesome DuckDB resourcesβ1,251Updated this week
- Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.β1,639Updated this week
- Build and share data reports in 100% Pythonβ1,369Updated 11 months ago
- Recipes for using Python's polars libraryβ243Updated last week
- The fastest β‘οΈ way to build data pipelines. Develop iteratively, deploy anywhere. βοΈβ3,479Updated 2 months ago
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wrβ¦β1,779Updated this week
- PyAirbyte brings the power of Airbyte to every Python developer.β205Updated this week
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.ioβ1,866Updated last week
- Distributed SQL Engine in Python using Daskβ385Updated 3 weeks ago