hi-primus / bumblebee
π A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
β140Updated last year
Related projects β
Alternatives and complementary repositories for bumblebee
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.β122Updated 3 years ago
- Data pipelines from re-usable componentsβ106Updated last year
- Type System for Data Analysis in Pythonβ209Updated 3 months ago
- Tool to automate data quality checks on data pipelinesβ249Updated 2 years ago
- π Notebook storage and publishing workflows for the massesβ203Updated 3 years ago
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.β78Updated last week
- A frictionless integrated platform for notebookβ85Updated last year
- Woodwork is a Python library that provides robust methods for managing and communicating data typing information.β147Updated last week
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).β121Updated 6 months ago
- DataFlows is a simple, intuitive lightweight framework for building data processing flows in python.β198Updated last week
- SQL interface to Pandasβ51Updated 2 years ago
- Automated Exploratory Data Analysis. Simplifying Data Explorationβ34Updated 4 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withouβ¦β111Updated 7 months ago
- The Data Explorer is nteract's automatic visualization tool.β104Updated last year
- MLOps simplified. One platform, all the functionality you need. Swiss madeβ95Updated last week
- Build and deploy a serverless data pipeline on AWS with no effort.β110Updated last year
- A web frontend for scheduling Jupyter notebook reportsβ251Updated 2 years ago
- Primrose modeling framework for simple production modelsβ34Updated 8 months ago
- β27Updated this week
- Fast iterative local development and testing of Apache Airflow workflowsβ193Updated 5 months ago
- manipulate pandas dataframes from the comfort of your browserβ172Updated 3 years ago
- Python package for deduplication/entity resolution using active learningβ78Updated 2 months ago
- python library for automated dataset normalizationβ112Updated last year
- A Python package that parses SQL and interprets it as methods that act upon existing pandas (or other types of) DataFrames that have beenβ¦β98Updated 3 years ago
- A Python DB-API and SQLAlchemy dialect to Google Spreasheetsβ213Updated last year
- Write python locally, execute SQL in your data warehouseβ270Updated 2 years ago
- An extention to json_normalize() in pandasβ27Updated 4 years ago
- ByteHub: making feature stores simpleβ58Updated 3 years ago