big-data-lab-team / paper-big-data-engines
A paper comparing Dask and Spark
☆11Updated 2 years ago
Alternatives and similar repositories for paper-big-data-engines:
Users that are interested in paper-big-data-engines are comparing it to the libraries listed below
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.☆50Updated last year
- This repo is an approach to TDD in machine learning model operation. it covers project structure, testing essentials using pytest with Gi…☆15Updated 4 years ago
- Pandas helper functions☆30Updated last year
- The Baseline Site Selection Tool implements simulation tools for clinical trial enrollment.☆18Updated 2 years ago
- bamboolib - a GUI for pandas dataframes. Stop googling pandas commands☆28Updated 4 years ago
- RAPIDS data science. No setup required.☆20Updated 3 years ago
- Get introduced to Directed Acyclic Graphs (DAGs) through Dagster with a simple ML program☆12Updated last year
- JupyterLab Extension for dependency management and optimization☆16Updated last year
- Open Targets Library ETL Pipeline | Apache Beam☆16Updated 3 years ago
- A place to provide Coiled feedback☆15Updated 6 months ago
- ☆19Updated 3 years ago
- Scaling Python Machine Learning☆45Updated last year
- Repository of Notebooks taken from https://neo4j.com/graph-algorithms-book/☆26Updated 4 years ago
- Source code of ME2Vec.☆14Updated last year
- Notebooks for the ML Link Prediction Course☆14Updated 4 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- The documentation for the Clustergrammer project☆10Updated 4 years ago
- real-time data + ML pipeline☆54Updated this week
- Projects developed by Domino's R&D team☆76Updated 2 years ago
- A work-in-progress book on Dask☆12Updated last year
- A repository containing an introduction to Panel made to be support videos and talks.☆56Updated 3 years ago
- JupyterLab renderer of dagitty causal diagrams☆20Updated last year
- DuckDB with Dashboarding tools demo evidence, streamlit and rill☆14Updated last year
- ☆22Updated 4 months ago
- Techniques & resources for training interpretable ML models, explaining ML models, and debugging ML models.☆21Updated 2 years ago