KjellSchubert / cheatsheets
personal cheatsheets on various technologies
☆25Updated 8 years ago
Alternatives and similar repositories for cheatsheets:
Users that are interested in cheatsheets are comparing it to the libraries listed below
- A collection of data science examples implemented across a variety of languages and libraries.☆33Updated 9 years ago
- Proposals for new Jupyter subprojects to enter into incubation☆18Updated 4 years ago
- Materials for Strata Singapore "Machine learning In Python with scikit-learn" tutorial.☆9Updated 9 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 8 years ago
- Apache Spark under Docker☆9Updated 8 years ago
- Apache Toree quickstart tutorial☆29Updated 9 years ago
- ☆15Updated 7 years ago
- Material for some talks I have given☆62Updated 6 months ago
- Apache Zeppelin on Kubernetes.☆28Updated 5 years ago
- Python client for ScienceOps☆29Updated 5 years ago
- Docker images for data science from Wise.io☆50Updated 9 years ago
- My talk at Strata 2014 in Santa Clara, CA☆73Updated 11 years ago
- Docker-izing Data Science Applications CodeLab for QCon AI 2018☆13Updated 7 years ago
- ☆9Updated 9 years ago
- ☆16Updated 7 years ago
- Alenka JDBC is a library for accessing and manipulating data with the open-source GPU database Alenka.☆19Updated 10 years ago
- Data Science box: Spark, Jupyter, R+RStudio, Zeppelin, Python 2 & 3, Java, Scala.☆39Updated 6 years ago
- Code and setup information for Introduction to Machine Learning with Spark☆12Updated 9 years ago
- training material☆47Updated 5 months ago
- Materials for dask talk at PyData NYC☆15Updated 9 years ago
- Articles on Data Science, Jupyter, and Pandas☆18Updated 9 years ago
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆26Updated 8 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Tutorial for Deploying Anaconda Cluster and PySpark on top of Red Hat Storage GlusterFS☆8Updated 10 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 9 years ago
- open source version of the Bonsai library☆26Updated 9 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- ☆24Updated 9 years ago
- Source code for 'Docker for Data Science' by Joshua Cook☆36Updated 7 years ago