Size of datasets used for analytics based on 10 years of surveys by KDnuggets.
☆16Nov 18, 2015Updated 10 years ago
Alternatives and similar repositories for dataset-sizes-kdnuggets
Users that are interested in dataset-sizes-kdnuggets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆27Nov 21, 2016Updated 9 years ago
- This app is my submission to the visualization contest held by Revolution Analytics.☆20Aug 29, 2014Updated 11 years ago
- Kaggle scripts: R vs pydata + most popular R and Python packages for Machine Learning☆10Apr 13, 2017Updated 8 years ago
- A Google Analytics CMP for MODx Revolution☆22May 20, 2015Updated 10 years ago
- Compare the scoring speed of several open source machine learning libraries.☆19Jun 19, 2017Updated 8 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Tuning GBMs (hyperparameter tuning) and impact on out-of-sample predictions☆21Sep 11, 2017Updated 8 years ago
- Python client for ScienceOps☆29Oct 22, 2019Updated 6 years ago
- GBM intro talk (with R and Python code)☆17May 6, 2021Updated 4 years ago
- This is the repo with the code snippets that supply the "R + Google Analytics = FUN" post regarding getting speed metrics and clickstream…☆31Jun 24, 2016Updated 9 years ago
- Python bindings for Matroid API☆17Aug 14, 2025Updated 7 months ago
- Machine Learning #1 and #2 courses at CEU Master of Science in Business Analytics☆22Feb 2, 2019Updated 7 years ago
- Minm Is Not Meta: One way to get several RMarkdown-using packages☆11Dec 20, 2025Updated 3 months ago
- split-apply-combine with optional collapsing groups☆12Jun 20, 2025Updated 9 months ago
- R package for split test/one-armed bandit analysis☆16May 5, 2014Updated 11 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- An R-like GLM package for Apache Spark☆10Aug 6, 2015Updated 10 years ago
- csvcat☆22Feb 23, 2016Updated 10 years ago
- Store, append, read large lists in R without loading whole data into memory.☆14Apr 18, 2017Updated 8 years ago
- Notes and code for the workshop "Rule-Based Models for Regression and Classification”☆13May 21, 2016Updated 9 years ago
- getSymbols() reboot☆17Oct 17, 2024Updated last year
- Youth Risk Behaviour Surveillance System Data☆13Feb 17, 2016Updated 10 years ago
- Models and Tools for Watershed Flux Estimates☆16Dec 6, 2024Updated last year
- **Archived** CRAN Task View: Reproducible Research☆17Apr 4, 2022Updated 3 years ago
- Batch scoring script for making predictions☆33Sep 9, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Book Hands on Machine Learning with Scikit-Learn and Tensorflow from O'reilly - Geron☆10May 11, 2017Updated 8 years ago
- A computer vision dataset of vehicle logo segmentation masks. There are 34 logos each with 16 images and masks.☆13Aug 3, 2020Updated 5 years ago
- Notes and code from a network biology study group at UMD.☆10Apr 11, 2015Updated 10 years ago
- R interface to POSIX mmap and Window's MapViewOfFile.☆17Jan 30, 2026Updated last month
- A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other…☆13Feb 23, 2015Updated 11 years ago
- Youtube companion (https://www.youtube.com/watch?v=1Mt7EuVJf1A&feature=youtu.be) - Brief introduction to the SMOTE R package to super-sam…☆11Aug 19, 2014Updated 11 years ago
- R Package to stream and analyze tweets using a mongodb☆13Mar 1, 2016Updated 10 years ago
- Modelling Airbnb prices in London using different Machine Learning models (Random Forest, Gradient Boosting, Neural Network)☆10Feb 5, 2019Updated 7 years ago
- ☆15Jan 23, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A minimal benchmark of various tools (statistical software, databases etc.) for working with tabular data of moderately large sizes (inte…☆89Jul 25, 2017Updated 8 years ago
- Development and Errata for the book☆12May 6, 2013Updated 12 years ago
- SF DAT 22 Course Repository☆13Jun 3, 2016Updated 9 years ago
- An R package of utilities for benchmarking and optimization☆47Aug 11, 2023Updated 2 years ago
- Docker container to make running Luigi tasks real easy.☆11Aug 31, 2016Updated 9 years ago
- The R code compares the performance metrics between logistic regression, SVM, Naive Bayes, Knn and random forest classifers in a 10 fold …☆15Mar 13, 2016Updated 10 years ago
- R package for solving cone constrained convex optimization problems.☆18Sep 27, 2025Updated 6 months ago