nebari-dev / big-data-tutorialLinks
🧑🏫 Practical guide to big data analysis, with Python
☆23Updated last year
Alternatives and similar repositories for big-data-tutorial
Users that are interested in big-data-tutorial are comparing it to the libraries listed below
Sorting:
- 📖 Documentation for Nebari☆16Updated 2 weeks ago
- IPython magic for parallel profiling (like `%time`, but parallel)☆72Updated 8 years ago
- Cluster tools for running Dask on Databricks☆14Updated last year
- ☆89Updated 7 months ago
- Simple markdown changelogs for GitHub repositories☆53Updated 2 weeks ago
- A Zoo for decorators☆26Updated 3 weeks ago
- For when your data won't fit in your dataframe☆48Updated last month
- Extension to hypothesis for testing numpy general universal functions☆39Updated 4 years ago
- Use pathlib syntax to easily work with Pandas series containing file paths.☆70Updated 2 years ago
- RFC document, tooling and other content related to the dataframe API standard☆109Updated last year
- A pytest plugin for regression testing and regenerating Jupyter Notebooks☆52Updated last week
- ☆86Updated last month
- ☆85Updated 11 months ago
- An Python object protocol for projects to interchange data frame-like data without forcing pandas.DataFrame as the intermediary☆15Updated 5 years ago
- A Python library for manipulating indices of ndarrays☆107Updated last week
- JupyterLab UI Testing Framework☆31Updated 3 years ago
- Application creator and general launcher for JupyterHub☆38Updated this week
- Construct, deconstruct, convert, execute, and prepare slides from Jupyter notebooks☆34Updated 2 months ago
- A `select` accessor for easier subsetting of pandas DataFrames and Series☆34Updated 2 years ago
- Robust statistics in Python☆67Updated 2 months ago
- The Pandata scalable open-source analysis stack☆68Updated last year
- Tool to merge environment files of the conda package manager☆58Updated 8 months ago
- "Hacking Dask" tutorial materials☆71Updated 4 years ago
- Python packaging made simple. Recommendations & guidance curated by the pyOpenSci community☆129Updated last week
- ⚡️ An efficient cache for the execution of dask graphs.☆71Updated last year
- A three-hour tutorial on property-based testing with https://hypothesis.works☆59Updated last year
- Bidirectional communication for the HoloViz ecosystem☆34Updated last month
- Rethinking machine learning pipelines☆32Updated 9 months ago
- dataframe visualiser☆17Updated 6 years ago
- Call Jupyter notebooks as Python functions☆56Updated 8 months ago