ibis-project / ibis
the portable Python dataframe library
☆5,608Updated this week
Alternatives and similar repositories for ibis:
Users that are interested in ibis are comparing it to the libraries listed below
- A light-weight, flexible, and expressive statistical data testing library☆3,688Updated 2 weeks ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,055Updated 6 months ago
- 📚 Parameterize, execute, and analyze notebooks☆6,113Updated 2 months ago
- Always know what to expect from your data.☆10,273Updated this week
- Parallel computing with task scheduling☆13,045Updated this week
- Fastest library to load data from DB to DataFrames in Rust and Python☆2,190Updated this week
- An orchestration platform for the development, production, and observation of data assets.☆12,772Updated this week
- Distributed data engine for Python/SQL designed for the cloud, powered by Rust☆2,633Updated this week
- Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, v…☆4,281Updated this week
- NumPy and Pandas interface to Big Data☆3,196Updated last year
- Declarative visualization library for Python☆9,651Updated 2 weeks ago
- Efficient data transformation and modeling framework that is backwards compatible with dbt.☆2,180Updated this week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,356Updated 5 months ago
- Computing with Python functions.☆4,013Updated this week
- Python SQL Parser and Transpiler☆7,324Updated this week
- data load tool (dlt) is an open source Python library that makes data loading easy 🛠️☆3,355Updated this week
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,557Updated 6 months ago
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆18,661Updated this week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆15,129Updated this week
- A native Rust library for Delta Lake, with bindings into Python☆2,621Updated this week
- A Grammar of Graphics for Python☆4,155Updated this week
- Panel: The powerful data exploration & web app framework for Python☆5,114Updated this week
- cuDF - GPU DataFrame Library☆8,776Updated this week
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,402Updated this week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆10,554Updated this week
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,530Updated last week
- Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts☆6,783Updated last week
- 🦆 A curated list of awesome DuckDB resources☆1,646Updated this week
- python implementation of the parquet columnar file format.☆817Updated 4 months ago
- Dataframes powered by a multithreaded, vectorized query engine, written in Rust☆32,499Updated this week