hi-primus / bumblebee
π A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
β141Updated last year
Alternatives and similar repositories for bumblebee:
Users that are interested in bumblebee are comparing it to the libraries listed below
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.β124Updated 3 years ago
- Type System for Data Analysis in Pythonβ212Updated 3 months ago
- π Notebook storage and publishing workflows for the massesβ202Updated 3 years ago
- Data pipelines from re-usable componentsβ108Updated 2 years ago
- A web frontend for scheduling Jupyter notebook reportsβ252Updated 5 months ago
- Beneath is a serverless real-time data platform β‘οΈβ84Updated 3 years ago
- Woodwork is a Python library that provides robust methods for managing and communicating data typing information.β153Updated last week
- π Notebook sharing hubβ498Updated last year
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withouβ¦β113Updated last year
- MLOps simplified. One-stop AI delivery platform, all the features you need.β98Updated this week
- manipulate pandas dataframes from the comfort of your browserβ171Updated 3 years ago
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).β122Updated 3 weeks ago
- A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (inβ¦β255Updated 11 months ago
- Automated Exploratory Data Analysis. Simplifying Data Explorationβ35Updated 4 years ago
- Fast iterative local development and testing of Apache Airflow workflowsβ200Updated last week
- β27Updated 3 months ago
- The goal of pandas-log is to provide feedback about basic pandas operations. It provides simple wrapper functions for the most common funβ¦β216Updated 3 years ago
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.β79Updated 2 weeks ago
- Build, test, deploy, iterate - Dev and prod tool for data science pipelinesβ59Updated 2 years ago
- A library for recording and reading data in notebooks.β289Updated 3 years ago
- Python ELT Studio, an application for building ELT (and ETL) data flows.β57Updated 3 years ago
- Write python locally, execute SQL in your data warehouseβ269Updated 2 years ago
- python library for automated dataset normalizationβ114Updated last year
- A bit of extra usability for sqlalchemy v2.β77Updated 11 months ago
- OlliePy is a python package which can help data scientists in exploring their data and evaluating and analysing their machine learning exβ¦β51Updated last year
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable froβ¦β27Updated 2 years ago
- Low-code Python library to safely use notebooks in production: schedule workflows, generate assets, trigger webhooks, send notifications,β¦β285Updated 2 months ago
- An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooksβ21Updated 2 years ago
- A Python library for working with Table Schema.β263Updated 5 months ago
- Magniv Core - A Python-decorator based job orchestration platform. Avoid responsibility handoffs by abstracting infra and DevOps.β78Updated 9 months ago