darenasc / aeda
Build a data catalog by running a single line of code
☆17Updated 2 months ago
Alternatives and similar repositories for aeda
Users that are interested in aeda are comparing it to the libraries listed below
Sorting:
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆21Updated 4 years ago
- Record matching and entity resolution at scale in Spark☆34Updated last year
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆35Updated 4 years ago
- 🔍Your Data Quality Detector / Gain insight into your data and get it ready for use before you start working with it 💡📊🛠💎☆16Updated 2 years ago
- Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browser☆33Updated 2 years ago
- Using the Parquet file format with Python☆15Updated last year
- Declarative layer for your database.☆37Updated 2 years ago
- A Delta Lake reader for Dask☆49Updated 7 months ago
- A monorepo of many Rill example projects☆36Updated 2 weeks ago
- An automation tool to refactor Jupyter Notebooks to Python modules, with code dependency analysis.☆12Updated 2 months ago
- Demo converting streamlit uber nyc rides to use duckdb☆29Updated 2 years ago
- Talk "Beyond pandas: The great Python dataframe showdown"☆37Updated 2 years ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆77Updated last year
- A software engineering framework to jump start your machine learning projects☆37Updated 10 months ago
- ☆29Updated last year
- A markdown wiki and dashboarding system for Datasette☆21Updated 3 years ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆32Updated 3 years ago
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflow…☆11Updated 2 years ago
- Versatile Metrics Collection for Python☆19Updated last year
- A data wrangling and modeling tool.☆63Updated last year
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- Evaluation Matrix for Change Data Capture☆25Updated 9 months ago
- Investigation for PyDataLondon 2023 and ODSC 2023 conference comparing Pandas 2, Polars and Dask☆11Updated last year
- Example project for building scalable data pipelines with Kedro and Ibis.☆13Updated last year
- DuckDB SQL Tools add DuckDB support to VSCode, and provide database schema and SQL query interfaces for the popular SQLTools extension, S…☆17Updated 10 months ago
- Explore Crime in Toronto by Neighbourhood.☆12Updated last year
- data wrangling simplicity, complete audit transparency, and at speed☆34Updated 2 months ago
- A python library bakeoff for medium sized datasets☆24Updated last year