nebari-dev / big-data-tutorial
🧑🏫 Practical guide to big data analysis, with Python
☆22Updated 9 months ago
Alternatives and similar repositories for big-data-tutorial:
Users that are interested in big-data-tutorial are comparing it to the libraries listed below
- Cluster tools for running Dask on Databricks☆13Updated 11 months ago
- 📖 Documentation for Nebari☆16Updated this week
- A pytest plugin for regression testing and regenerating Jupyter Notebooks☆52Updated this week
- Simple markdown changelogs for GitHub repositories☆52Updated 3 weeks ago
- Hatch plugin for conda environments☆40Updated 11 months ago
- Material for Inside Dask talk | PyData DC | August 2021☆13Updated 3 years ago
- Rethinking machine learning pipelines☆30Updated 5 months ago
- For when your data won't fit in your dataframe☆44Updated 3 weeks ago
- ☆89Updated 3 months ago
- A JupyterLab gallery for presenting and downloading examples☆10Updated 7 months ago
- The Pandata scalable open-source analysis stack☆68Updated 11 months ago
- [DEPRECATED] Use setup-micromamba instead☆74Updated last year
- "Hacking Dask" tutorial materials☆71Updated 3 years ago
- An Python object protocol for projects to interchange data frame-like data without forcing pandas.DataFrame as the intermediary☆15Updated 5 years ago
- Advanced algorithms for xarray☆37Updated 2 months ago
- animate your data to life☆28Updated last year
- Extremely lightweight compatibility layer between pandas and Polars☆41Updated last year
- xarray data creation by data classes☆78Updated 4 months ago
- IPython magic for parallel profiling (like `%time`, but parallel)☆71Updated 7 years ago
- Generate conda environment.yml from PEP 621 and/or flit config.☆10Updated 3 years ago
- dataframe visualiser☆17Updated 5 years ago
- Streaming and approximate algorithms. WIP, use at own risk.☆26Updated 4 months ago
- Schema validation for Xarray objects☆42Updated last month
- Ibis tutorial repository☆32Updated 10 months ago
- Use pathlib syntax to easily work with Pandas series containing file paths.☆69Updated last year
- Conda packages from flit information☆10Updated 3 years ago
- Construct, deconstruct, convert, execute, and prepare slides from Jupyter notebooks☆34Updated this week
- ☆85Updated 8 months ago
- Time based splits for cross validation☆38Updated last week
- RFC document, tooling and other content related to the dataframe API standard☆108Updated last year