Size of datasets used for analytics based on 10 years of surveys by KDnuggets.
☆16Nov 18, 2015Updated 10 years ago
Alternatives and similar repositories for dataset-sizes-kdnuggets
Users that are interested in dataset-sizes-kdnuggets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆27Nov 21, 2016Updated 9 years ago
- This app is my submission to the visualization contest held by Revolution Analytics.☆20Aug 29, 2014Updated 11 years ago
- Materials for a short introductory/intermediate Data Science course taught in the MSc in Business Analytics program at the Central Europe…☆33Sep 8, 2017Updated 8 years ago
- Kaggle scripts: R vs pydata + most popular R and Python packages for Machine Learning☆10Apr 13, 2017Updated 9 years ago
- Some examples of Yhat☆23Jun 11, 2014Updated 11 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Google Analytics CMP for MODx Revolution☆22May 20, 2015Updated 10 years ago
- Compare the scoring speed of several open source machine learning libraries.☆19Jun 19, 2017Updated 8 years ago
- Most recent/important talks given at conferences/meetups☆14Nov 27, 2020Updated 5 years ago
- GBM multicore scaling: h2o, xgboost and lightgbm on multicore and multi-socket systems☆20May 13, 2018Updated 7 years ago
- Tuning GBMs (hyperparameter tuning) and impact on out-of-sample predictions☆21Sep 11, 2017Updated 8 years ago
- Python client for ScienceOps☆29Oct 22, 2019Updated 6 years ago
- GBM intro talk (with R and Python code)☆17May 6, 2021Updated 4 years ago
- Estimating repeat spectra and genome length from low-coverage genome skims☆12Aug 6, 2023Updated 2 years ago
- This is the repo with the code snippets that supply the "R + Google Analytics = FUN" post regarding getting speed metrics and clickstream…☆31Jun 24, 2016Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Python bindings for Matroid API☆17Aug 14, 2025Updated 8 months ago
- Machine Learning #1 and #2 courses at CEU Master of Science in Business Analytics☆22Feb 2, 2019Updated 7 years ago
- Minm Is Not Meta: One way to get several RMarkdown-using packages☆11Dec 20, 2025Updated 3 months ago
- split-apply-combine with optional collapsing groups☆12Jun 20, 2025Updated 9 months ago
- Course materials for Expert Data Wrangling with R. To purchase the videos or watch smaple lessons, visit http://shop.oreilly.com/product/…☆11Sep 14, 2015Updated 10 years ago
- R package for split test/one-armed bandit analysis☆16May 5, 2014Updated 11 years ago
- In-class exercises for Deep Learning course at NYC Data Science Academy☆33Feb 28, 2018Updated 8 years ago
- Notes and code for the workshop "Rule-Based Models for Regression and Classification”☆13May 21, 2016Updated 9 years ago
- getSymbols() reboot☆17Oct 17, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Youth Risk Behaviour Surveillance System Data☆13Feb 17, 2016Updated 10 years ago
- Models and Tools for Watershed Flux Estimates☆16Dec 6, 2024Updated last year
- Batch scoring script for making predictions☆33Sep 9, 2020Updated 5 years ago
- Book Hands on Machine Learning with Scikit-Learn and Tensorflow from O'reilly - Geron☆10May 11, 2017Updated 8 years ago
- R interface to POSIX mmap and Window's MapViewOfFile.☆17Apr 9, 2026Updated last week
- @Microsoft Data Camp (Analytics with Azure Machine Learning)☆26Mar 9, 2017Updated 9 years ago
- Quantitative analysis for traders on Oslo Stock Exchange. Download, plot and play with data from Oslo Børs and Nasdaq OMX☆10Jul 28, 2018Updated 7 years ago
- Modelling Airbnb prices in London using different Machine Learning models (Random Forest, Gradient Boosting, Neural Network)☆10Feb 5, 2019Updated 7 years ago
- ☆15Jan 23, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A minimal benchmark of various tools (statistical software, databases etc.) for working with tabular data of moderately large sizes (inte…☆89Jul 25, 2017Updated 8 years ago
- Biochemical-free enrichment or depletion of RNA classes in real-time during direct RNA sequencing☆15Aug 27, 2025Updated 7 months ago
- Development and Errata for the book☆12May 6, 2013Updated 12 years ago
- SF DAT 22 Course Repository☆13Jun 3, 2016Updated 9 years ago
- An R package of utilities for benchmarking and optimization☆47Aug 11, 2023Updated 2 years ago
- A bundle of analytics tools for fisheries scientists☆14Mar 27, 2024Updated 2 years ago
- The R code compares the performance metrics between logistic regression, SVM, Naive Bayes, Knn and random forest classifers in a 10 fold …☆15Mar 13, 2016Updated 10 years ago