pnavaro / big-dataLinks
Python tools for big data
☆53Updated 2 years ago
Alternatives and similar repositories for big-data
Users that are interested in big-data are comparing it to the libraries listed below
Sorting:
- Phi_K correlation analyzer library☆167Updated 2 weeks ago
- Start a data science project with modern tools☆202Updated 2 years ago
- A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using th…☆117Updated 2 years ago
- pipreqs with jupyter notebook support☆70Updated 2 years ago
- PyData London 2022 Tutorial☆67Updated 3 years ago
- How to Interpret SHAP Analyses: A Non-Technical Guide☆56Updated 3 years ago
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆47Updated 7 months ago
- Repository for the book Fast Python - published by Manning☆110Updated 5 months ago
- Wrap-up to automatically tune xgboost in Python.☆80Updated 4 years ago
- Templates for jupyter notebooks☆147Updated last year
- ☆72Updated 2 years ago
- Repository for a workshop on Bayesian Decision Analysis☆73Updated 2 years ago
- 💫 PyScaffold extension for data-science projects☆158Updated 2 weeks ago
- Sensible multi-core apply function for Pandas☆88Updated 3 weeks ago
- ☆150Updated 2 years ago
- Increase citations, ease review & collaboration A collection of "easy wins" to make machine learning in research reproducible. This tut…☆75Updated 3 weeks ago
- Jupyter Widget for Lux☆76Updated 2 years ago
- Easy-to-run example notebooks for Dask☆381Updated last year
- Source for the PyViz.org website.☆182Updated 2 weeks ago
- ☆93Updated last month
- Clustergram - Visualization and diagnostics for cluster analysis in Python☆128Updated 3 weeks ago
- Python port of "Common statistical tests are linear models" by Jonas Kristoffer Lindeløv.☆95Updated last year
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆84Updated last year
- An abstraction layer for parameter tuning☆35Updated last year
- This Repository contains the material for the tutorial "Introduction to MLOps with MLflow" held at pyData/pyCon Berlin 2022.☆22Updated 3 years ago
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆312Updated 6 months ago
- Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.☆133Updated last year
- Data Structures and Information Retrieval in Python☆133Updated last year
- Explore and compare 1K+ accurate decision trees in your browser!☆169Updated last year
- In which I put together my thoughts on the practice of data science.☆303Updated 2 years ago