StrikingLoo / dask-dataframe-benchmarking
A Jupyter Notebook I made to try out dask's Dataframes
☆27Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for dask-dataframe-benchmarking
- Tutorial covering a new workflow available going from pandas to scikit-learn☆40Updated last year
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆34Updated 4 years ago
- In which I implement some applications of machine learning techniques.☆30Updated 8 years ago
- Data Science for Good Projects☆49Updated 6 years ago
- ☆102Updated 6 years ago
- Python data science and machine learning from Ted Petrou with Dunder Data☆52Updated 2 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆52Updated 4 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆36Updated 5 years ago
- Repository for an online class on Exploratory Data Analysis in Python☆65Updated 5 years ago
- ☆26Updated 5 years ago
- ☆47Updated 2 years ago
- ☆76Updated 6 years ago
- A repository filled with various data science projects.☆32Updated 4 years ago
- Extracting LinkedIn comments from any post and export it to Excel file☆23Updated 6 years ago
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆44Updated 2 weeks ago
- Interactive dashboard that show a decision support system to help DYCD/DOE’s award RFPs for the 2015 SONYC expansion.☆38Updated 2 years ago
- Contains code and presentation for my interactive hack session, 'Effective Feature Engineering: A Structured Approach to Building Better …☆29Updated 3 years ago
- Slides and code examples for H2O tutorials at various events☆56Updated 7 years ago
- Work for Mastering Large Datasets with Python☆18Updated last year
- Code for step-by-step tutorial recorded on January 27, 2018 about how to create web applications with Plotly Dash framework.☆15Updated 5 years ago
- ☆57Updated 5 years ago
- 📝 A blog post about report generation and automation in python☆40Updated 5 years ago
- ☆39Updated 7 years ago
- Tutorial given at PyData LA 2018☆97Updated 2 months ago
- Forecasting Uber demand in NYC neighborhoods☆34Updated 6 years ago
- helpful resources for (big) data science☆33Updated 3 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Springboard - Data Science Intensive course☆13Updated 7 years ago