pnavaro / big-dataLinks
Python tools for big data
☆53Updated 2 years ago
Alternatives and similar repositories for big-data
Users that are interested in big-data are comparing it to the libraries listed below
Sorting:
- A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using th…☆119Updated 3 years ago
- Phi_K correlation analyzer library☆170Updated last week
- Start a data science project with modern tools☆203Updated 2 years ago
- Templates for jupyter notebooks☆147Updated last year
- Data Analysis Baseline Library☆133Updated last year
- PyData London 2022 Tutorial☆68Updated 3 years ago
- ☆123Updated last year
- pipreqs with jupyter notebook support☆71Updated 2 years ago
- 💫 PyScaffold extension for data-science projects☆158Updated 2 weeks ago
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆48Updated 9 months ago
- In which I put together my thoughts on the practice of data science.☆303Updated 2 years ago
- Sensible multi-core apply function for Pandas☆88Updated 3 weeks ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆86Updated last year
- Talks about vaex☆36Updated 3 years ago
- DataFrame support for scikit-learn.☆63Updated 3 months ago
- Python library that represents empirical distribution functions.☆173Updated 6 months ago
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.☆51Updated 2 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆53Updated 5 years ago
- Repository for a workshop on Bayesian Decision Analysis☆74Updated 2 years ago
- ☆102Updated 3 months ago
- Altair backend for pandas plotting☆104Updated 4 years ago
- Adding timestamps to NumFOCUS and PyData YouTube videos!☆104Updated 3 years ago
- Repository for the book Fast Python - published by Manning☆114Updated 7 months ago
- Wrap-up to automatically tune xgboost in Python.☆81Updated 4 years ago
- A curated list of Python libraries used for data science.☆93Updated last year
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆84Updated 3 months ago
- Tool for whitebox (binning + logreg) model development☆77Updated 3 years ago
- Get started DVC project (NLP, random forest)☆189Updated last year
- Increase citations, ease review & collaboration A collection of "easy wins" to make machine learning in research reproducible. This tut…☆75Updated last month
- Clustergram - Visualization and diagnostics for cluster analysis in Python☆127Updated 2 months ago