nebari-dev / big-data-tutorialLinks
🧑🏫 Practical guide to big data analysis, with Python
☆23Updated last year
Alternatives and similar repositories for big-data-tutorial
Users that are interested in big-data-tutorial are comparing it to the libraries listed below
Sorting:
- Use pathlib syntax to easily work with Pandas series containing file paths.☆70Updated last week
- IPython magic for parallel profiling (like `%time`, but parallel)☆72Updated 8 years ago
- 📖 Documentation for Nebari☆16Updated last week
- ☆89Updated 7 months ago
- A pytest plugin for regression testing and regenerating Jupyter Notebooks☆52Updated this week
- ☆86Updated last week
- For when your data won't fit in your dataframe☆48Updated 2 months ago
- dataframe visualiser☆17Updated 6 years ago
- ☆84Updated last year
- Material for Inside Dask talk | PyData DC | August 2021☆13Updated 3 years ago
- Simple markdown changelogs for GitHub repositories☆53Updated this week
- Extension to hypothesis for testing numpy general universal functions☆39Updated 4 years ago
- Property-based testing tutorial☆59Updated 2 years ago
- Rethinking machine learning pipelines☆32Updated 9 months ago
- Build a tested, sphinx-based website from notebooks☆31Updated 2 weeks ago
- Notes and experiments in Jupyter dashboarding☆16Updated 4 years ago
- animate your data to life☆28Updated 2 years ago
- Streaming and approximate algorithms. WIP, use at own risk.☆27Updated last week
- Bidirectional communication for the HoloViz ecosystem☆34Updated 2 months ago
- The Pandata scalable open-source analysis stack☆68Updated last year
- Python package implementing transformers for pre processing steps for machine learning.☆64Updated last week
- A three-hour tutorial on property-based testing with https://hypothesis.works☆59Updated last year
- "Hacking Dask" tutorial materials☆71Updated 4 years ago
- Hatch plugin for conda environments☆42Updated last year
- A `select` accessor for easier subsetting of pandas DataFrames and Series☆34Updated 2 years ago
- Cluster tools for running Dask on Databricks☆14Updated last year
- Application creator and general launcher for JupyterHub☆40Updated last week
- The purpose of this repository is to make it as easy as possible to develop and use awesome Panel extensions.☆57Updated last year
- JupyterLab UI Testing Framework☆31Updated 4 years ago
- RFC document, tooling and other content related to the dataframe API standard☆108Updated last year