hi-primus / bumblebee
π A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
β140Updated last year
Alternatives and similar repositories for bumblebee:
Users that are interested in bumblebee are comparing it to the libraries listed below
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.β123Updated 3 years ago
- π Notebook storage and publishing workflows for the massesβ203Updated 3 years ago
- Data pipelines from re-usable componentsβ108Updated last year
- Type System for Data Analysis in Pythonβ210Updated 5 months ago
- Beneath is a serverless real-time data platform β‘οΈβ84Updated 2 years ago
- A web frontend for scheduling Jupyter notebook reportsβ252Updated last month
- The Data Explorer is nteract's automatic visualization tool.β104Updated 2 years ago
- plait.py - a fake data modelerβ433Updated 6 years ago
- A library for recording and reading data in notebooks.β285Updated 2 years ago
- T4 is now in production as Quilt 3β64Updated 5 years ago
- βοΈ Parallel and distributed training with spaCy and Rayβ53Updated last year
- MLOps simplified. One platform, all the functionality you need. Swiss madeβ97Updated last month
- A bit of extra usability for sqlalchemy v2.β77Updated 7 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withouβ¦β113Updated 9 months ago
- Low-code Python library to safely use notebooks in production: schedule workflows, generate assets, trigger webhooks, send notifications,β¦β285Updated 5 months ago
- Primrose modeling framework for simple production modelsβ33Updated 9 months ago
- Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browserβ33Updated last year
- A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (inβ¦β256Updated 7 months ago
- manipulate pandas dataframes from the comfort of your browserβ172Updated 3 years ago
- Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.β105Updated 2 years ago
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.β79Updated 3 weeks ago
- A frictionless integrated platform for notebookβ85Updated 2 years ago
- β39Updated 5 years ago
- Woodwork is a Python library that provides robust methods for managing and communicating data typing information.β151Updated this week
- KNOTS is an intuitive desktop application built to simplify the configuration of Singer pipelinesβ67Updated last year
- Repo demonstrating a Dagster pipeline to generate Neo4j Graphβ21Updated 3 years ago
- Python ELT Studio, an application for building ELT (and ETL) data flows.β57Updated 3 years ago
- A browser user interface for manual labeling of record pairs.β42Updated last year