hi-primus / bumblebeeLinks
π A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
β141Updated last year
Alternatives and similar repositories for bumblebee
Users that are interested in bumblebee are comparing it to the libraries listed below
Sorting:
- Data pipelines from re-usable componentsβ108Updated 2 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.β126Updated 3 years ago
- π Notebook storage and publishing workflows for the massesβ202Updated 3 years ago
- Type System for Data Analysis in Pythonβ212Updated 4 months ago
- A web frontend for scheduling Jupyter notebook reportsβ253Updated 6 months ago
- The Data Explorer is nteract's automatic visualization tool.β107Updated 2 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withouβ¦β113Updated last year
- Automated Exploratory Data Analysis. Simplifying Data Explorationβ36Updated 4 years ago
- T4 is now in production as Quilt 3β64Updated 6 years ago
- Magniv Core - A Python-decorator based job orchestration platform. Avoid responsibility handoffs by abstracting infra and DevOps.β79Updated 10 months ago
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).β121Updated last month
- A bit of extra usability for sqlalchemy v2.β77Updated last year
- plait.py - a fake data modelerβ434Updated 6 years ago
- Beneath is a serverless real-time data platform β‘οΈβ84Updated 3 years ago
- A library for recording and reading data in notebooks.β290Updated 3 years ago
- A frictionless integrated platform for notebookβ85Updated 2 years ago
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.β79Updated 2 weeks ago
- Primrose modeling framework for simple production modelsβ32Updated last year
- Fast iterative local development and testing of Apache Airflow workflowsβ201Updated last month
- π Notebook sharing hubβ500Updated last year
- Low-code Python library to safely use notebooks in production: schedule workflows, generate assets, trigger webhooks, send notifications,β¦β285Updated 3 months ago
- Dockerfiles for images used as part of the Orbyter toolsetβ44Updated last year
- MLOps simplified. One-stop AI delivery platform, all the features you need.β99Updated this week
- Tool to automate data quality checks on data pipelinesβ255Updated 2 years ago
- A python client library for the Stitch Import APIβ42Updated last year
- A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trinoβ88Updated 2 months ago
- Nuclio Function Automation for Python and Jupyterβ89Updated 6 months ago
- manipulate pandas dataframes from the comfort of your browserβ171Updated 3 years ago
- Python ELT Studio, an application for building ELT (and ETL) data flows.β57Updated 3 years ago
- A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels fβ¦β505Updated 2 months ago