j-bennet / talksLinks
Code snippets to use in talks
☆20Updated 2 years ago
Alternatives and similar repositories for talks
Users that are interested in talks are comparing it to the libraries listed below
Sorting:
- A short tutorial for data scientists on how to write tests for code + data.☆120Updated 4 years ago
- Quickstart PySpark with Anaconda on AWS/EMR☆53Updated 8 years ago
- Example unit tests for Apache Spark Python scripts using the py.test framework☆84Updated 9 years ago
- Magic functions for using Jupyter Notebook with Apache Spark and a variety of SQL databases.☆171Updated 6 years ago
- Course materials for my data pipeline video course with O'Reilly☆198Updated 7 years ago
- Create HTML profiling reports from Apache Spark DataFrames☆196Updated 5 years ago
- A luigi powered analytics / warehouse stack☆88Updated 8 years ago
- ☆22Updated 7 years ago
- Example Python DS project☆71Updated 6 years ago
- Unit and integration testing with PySpark can be tough to figure out, let's make that easier.☆22Updated 9 years ago
- python automatic data quality check toolkit☆283Updated 4 years ago
- A fork of the cookiecutter-data-science leveraging Docker for local development.☆131Updated 5 years ago
- A simple Spark TDD example☆26Updated 7 years ago
- Reproducible Data Analysis Workflow in Jupyter☆118Updated 6 years ago
- ☆86Updated 6 years ago
- Materials for "Docker for Data Science" tutorial presented at PyCon 2018 in Cleveland, OH☆154Updated 4 years ago
- edaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab☆224Updated 5 years ago
- ☆49Updated 6 years ago
- Material for Talk at PyData Seattle 2017☆168Updated 7 years ago
- ☆135Updated 5 years ago
- Start a cluster in EC2 for dask.distributed☆106Updated 4 years ago
- ☆29Updated 8 years ago
- Using Project Jupyter for data science.☆258Updated 4 years ago
- Open source Flotilla☆193Updated this week
- Summarise and explore Pandas DataFrames☆98Updated 4 years ago
- Render sparkline style charts in pandas dataframes☆93Updated 4 years ago
- A library for defensive data analysis.☆500Updated 5 years ago
- A web frontend for scheduling Jupyter notebook reports☆253Updated 6 months ago
- PyData NYC 2017: Pandas Head to Tail☆57Updated 7 years ago
- A Getting Started Guide for developing and using Airflow Plugins☆93Updated 6 years ago