Size of datasets used for analytics based on 10 years of surveys by KDnuggets.
☆16Nov 18, 2015Updated 10 years ago
Alternatives and similar repositories for dataset-sizes-kdnuggets
Users that are interested in dataset-sizes-kdnuggets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆27Nov 21, 2016Updated 9 years ago
- This app is my submission to the visualization contest held by Revolution Analytics.☆20Aug 29, 2014Updated 11 years ago
- Materials for a short introductory/intermediate Data Science course taught in the MSc in Business Analytics program at the Central Europe…☆33Sep 8, 2017Updated 8 years ago
- Kaggle scripts: R vs pydata + most popular R and Python packages for Machine Learning☆10Apr 13, 2017Updated 9 years ago
- Some examples of Yhat☆23Jun 11, 2014Updated 11 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A Google Analytics CMP for MODx Revolution☆22May 20, 2015Updated 10 years ago
- Compare the scoring speed of several open source machine learning libraries.☆19Jun 19, 2017Updated 8 years ago
- Most recent/important talks given at conferences/meetups☆14Nov 27, 2020Updated 5 years ago
- Pexpect is a pure Python module for spawning child applications; controlling them; and responding to expected patterns in their output.☆38Oct 26, 2012Updated 13 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Apr 13, 2017Updated 9 years ago
- This is the repo with the code snippets that supply the "R + Google Analytics = FUN" post regarding getting speed metrics and clickstream…☆31Jun 24, 2016Updated 9 years ago
- Python bindings for Matroid API☆17Aug 14, 2025Updated 8 months ago
- Minm Is Not Meta: One way to get several RMarkdown-using packages☆11Dec 20, 2025Updated 4 months ago
- Course materials for Expert Data Wrangling with R. To purchase the videos or watch smaple lessons, visit http://shop.oreilly.com/product/…☆11Sep 14, 2015Updated 10 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- R package for split test/one-armed bandit analysis☆16May 5, 2014Updated 12 years ago
- An R-like GLM package for Apache Spark☆10Aug 6, 2015Updated 10 years ago
- csvcat☆22Feb 23, 2016Updated 10 years ago
- Store, append, read large lists in R without loading whole data into memory.☆14Apr 18, 2017Updated 9 years ago
- Notes and code for the workshop "Rule-Based Models for Regression and Classification”☆13May 21, 2016Updated 9 years ago
- getSymbols() reboot☆17Oct 17, 2024Updated last year
- Youth Risk Behaviour Surveillance System Data☆13Feb 17, 2016Updated 10 years ago
- Models and Tools for Watershed Flux Estimates☆16Dec 6, 2024Updated last year
- **Archived** CRAN Task View: Reproducible Research☆17Apr 4, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DEPRECATED repo for Manning book Deep Learning with Structured Data - please see https://github.com/ryanmark1867/deep_learning_for_struct…☆12May 17, 2020Updated 5 years ago
- Convolutional Neural Network for Click-Through Rate prediction.☆15Sep 28, 2016Updated 9 years ago
- Batch scoring script for making predictions☆33Sep 9, 2020Updated 5 years ago
- A computer vision dataset of vehicle logo segmentation masks. There are 34 logos each with 16 images and masks.☆14Aug 3, 2020Updated 5 years ago
- R interface to POSIX mmap and Window's MapViewOfFile.☆17Apr 9, 2026Updated last month
- Notes and code from a network biology study group at UMD.☆10Apr 11, 2015Updated 11 years ago
- A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other…☆13Feb 23, 2015Updated 11 years ago
- @Microsoft Data Camp (Analytics with Azure Machine Learning)☆26Mar 9, 2017Updated 9 years ago
- R Package to stream and analyze tweets using a mongodb☆13Mar 1, 2016Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Modelling Airbnb prices in London using different Machine Learning models (Random Forest, Gradient Boosting, Neural Network)☆10Feb 5, 2019Updated 7 years ago
- ☆15Jan 23, 2024Updated 2 years ago
- A minimal benchmark of various tools (statistical software, databases etc.) for working with tabular data of moderately large sizes (inte…☆89Jul 25, 2017Updated 8 years ago
- Development and Errata for the book☆12May 6, 2013Updated 13 years ago
- An R package of utilities for benchmarking and optimization☆47Aug 11, 2023Updated 2 years ago
- A bundle of analytics tools for fisheries scientists☆14Mar 27, 2024Updated 2 years ago
- The R code compares the performance metrics between logistic regression, SVM, Naive Bayes, Knn and random forest classifers in a 10 fold …☆15Mar 13, 2016Updated 10 years ago