nebari-dev / big-data-tutorialLinks
🧑🏫 Practical guide to big data analysis, with Python
☆22Updated 11 months ago
Alternatives and similar repositories for big-data-tutorial
Users that are interested in big-data-tutorial are comparing it to the libraries listed below
Sorting:
- 📖 Documentation for Nebari☆16Updated last week
- IPython magic for parallel profiling (like `%time`, but parallel)☆72Updated 7 years ago
- A pytest plugin for regression testing and regenerating Jupyter Notebooks☆52Updated 2 weeks ago
- ☆89Updated 5 months ago
- Use pathlib syntax to easily work with Pandas series containing file paths.☆69Updated 2 years ago
- Cluster tools for running Dask on Databricks☆14Updated last year
- Hatch plugin for conda environments☆41Updated last year
- ☆84Updated 9 months ago
- For when your data won't fit in your dataframe☆46Updated 2 months ago
- Rethinking machine learning pipelines☆31Updated 7 months ago
- Application creator and general launcher for JupyterHub☆37Updated 2 weeks ago
- Material for Inside Dask talk | PyData DC | August 2021☆13Updated 3 years ago
- Simple markdown changelogs for GitHub repositories☆52Updated last week
- Comparing Polars to Pandas and a small introduction☆44Updated 4 years ago
- [DEPRECATED] Use setup-micromamba instead☆74Updated 2 years ago
- Streaming and approximate algorithms. WIP, use at own risk.☆27Updated 6 months ago
- The Pandata scalable open-source analysis stack☆68Updated last year
- An Python object protocol for projects to interchange data frame-like data without forcing pandas.DataFrame as the intermediary☆15Updated 5 years ago
- Time based splits for cross validation☆38Updated 3 weeks ago
- Advanced algorithms for xarray☆38Updated 3 months ago
- Schema validation for Xarray objects☆42Updated 2 months ago
- A JupyterLab gallery for presenting and downloading examples☆10Updated 8 months ago
- "Hacking Dask" tutorial materials☆71Updated 4 years ago
- Extremely lightweight compatibility layer between pandas and Polars☆41Updated last year
- Tool for encapsulating, running, and reproducing projects with Conda environments☆31Updated last month
- animate your data to life☆28Updated 2 years ago
- dataframe visualiser☆17Updated 5 years ago
- Build a tested, sphinx-based website from notebooks☆31Updated last week
- A `select` accessor for easier subsetting of pandas DataFrames and Series☆34Updated 2 years ago
- Bidirectional communication for the HoloViz ecosystem☆34Updated last week