pnavaro / big-dataLinks
Python tools for big data
☆53Updated last year
Alternatives and similar repositories for big-data
Users that are interested in big-data are comparing it to the libraries listed below
Sorting:
- Start a data science project with modern tools☆198Updated last year
- A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using th…☆115Updated 2 years ago
- pipreqs with jupyter notebook support☆70Updated 2 years ago
- Repository for the book Fast Python - published by Manning☆100Updated 2 months ago
- Phi_K correlation analyzer library☆164Updated last week
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.☆51Updated last year
- Sensible multi-core apply function for Pandas☆84Updated this week
- ☆12Updated 2 years ago
- This Repository contains the material for the tutorial "Introduction to MLOps with MLflow" held at pyData/pyCon Berlin 2022.☆23Updated 3 years ago
- ☆100Updated last week
- 💫 PyScaffold extension for data-science projects☆160Updated 3 weeks ago
- DataFrame support for scikit-learn.☆63Updated last week
- PyData London 2022 Tutorial☆66Updated 3 years ago
- Increase citations, ease review & collaboration A collection of "easy wins" to make machine learning in research reproducible. This tut…☆74Updated 7 months ago
- The binclass-tools package contains a set of Python wrappers and interactive plots that facilitate the analysis of binary classification …☆76Updated 2 years ago
- ☆123Updated last year
- How to Interpret SHAP Analyses: A Non-Technical Guide☆56Updated 3 years ago
- Wrap-up to automatically tune xgboost in Python.☆80Updated 3 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆52Updated 4 years ago
- This repository contains materials for AC295 fall 2020☆19Updated 4 years ago
- Talks about vaex☆36Updated 2 years ago
- Better heatmaps in Python☆136Updated 3 years ago
- Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.☆131Updated last year
- Automated Data Science and Machine Learning library to optimize workflow.☆104Updated 2 years ago
- Convert from Python script to Jupyter notebook and vice versa☆126Updated 11 months ago
- Clustergram - Visualization and diagnostics for cluster analysis in Python☆127Updated last week
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆47Updated 4 months ago
- The Data Science Interview Book☆36Updated 5 months ago
- Missing data amputation and exploration functions for Python☆71Updated 2 years ago
- In which I put together my thoughts on the practice of data science.☆298Updated last year