big-data-lab-team / paper-big-data-engines
A paper comparing Dask and Spark
☆10Updated 2 years ago
Alternatives and similar repositories for paper-big-data-engines
Users that are interested in paper-big-data-engines are comparing it to the libraries listed below
Sorting:
- TileDB integrations for machine learning data and model i/o (PyTorch, TensorFlow, Scikit-Learn)☆25Updated 2 months ago
- Introduction to Dask for PyTorch Workflows☆13Updated 4 years ago
- The Baseline Site Selection Tool implements simulation tools for clinical trial enrollment.☆18Updated 2 years ago
- RAPIDS data science. No setup required.☆21Updated 4 years ago
- A place to provide Coiled feedback☆19Updated 2 months ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.☆51Updated last year
- Study notes and demos.☆12Updated last year
- Generate beautiful, testable documentation with Jupyter Notebooks☆21Updated 2 years ago
- Dask integration for Snowflake☆30Updated 6 months ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- A repository containing an introduction to Panel made to be support videos and talks.☆56Updated 3 years ago
- Minutes from nteract monthly contributor meeting; reports and metrics☆9Updated 3 years ago
- Repository with code, notebook and slides for my PyData & PyConDE talk 2023 about Clean Coding practises.☆12Updated 2 years ago
- Machine learning on electronic health records☆11Updated 7 years ago
- A status bar for JupyterLab☆49Updated 6 years ago
- knime-scripting includes scripting extensions for KNIME to integrate R, Matlab, Python and Groovy scripts. These extensions include a col…☆45Updated 5 months ago
- ☆22Updated 7 months ago
- A GitHub action to build data science environment images with repo2docker and push them to registries.☆145Updated 6 months ago
- Accompanies the uncool MLOps workshop☆26Updated 2 years ago
- Create testable, reproduceable documentation with Jupyter notebooks☆51Updated 2 years ago
- JupyterLab renderer of dagitty causal diagrams☆24Updated last year
- Python package to visualize and cluster partial dependence.☆28Updated 3 years ago
- real-time data + ML pipeline☆54Updated this week
- Clinical NLP workshop for ODSC☆39Updated 5 years ago
- Customize 'react-toastify' to integrate nicely in JupyterLab.☆22Updated 2 years ago
- Cookiecutter template for testing Python scikit-learn clustering learners.☆16Updated 2 years ago
- Projects developed by Domino's R&D team☆76Updated 3 years ago
- Model-agnostic Statistical/Machine Learning explainability (currently Python) for tabular data☆9Updated last month
- Bidirectional communication for the HoloViz ecosystem☆34Updated last month