Assembly of fundamental statistics implemented based on Apache Spark
☆31Feb 11, 2016Updated 10 years ago
Alternatives and similar repositories for StatisticsOnSpark
Users that are interested in StatisticsOnSpark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Spark MLlib code optimized to efficiently support sparse data☆51Dec 22, 2016Updated 9 years ago
- Topic Modeling on Apache Spark☆94Mar 1, 2019Updated 7 years ago
- Yelp Restaurant Photo Classification - Kaggle competition☆12Apr 19, 2019Updated 7 years ago
- Winter Break Collaboratory DS Boot Camp during the academic year of 2017-2018☆14Feb 12, 2018Updated 8 years ago
- ☆21Dec 9, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Step-by-step Deep Leaning Tutorials on Apache Spark using BigDL☆210Jan 3, 2023Updated 3 years ago
- Links to example code downloads for Learning Path: Get Started with Natural Language Processing Using Python, Spark, and Scala☆17Feb 23, 2017Updated 9 years ago
- Gradient Boosting Enhanced with Step-Wise Feature Augmentation☆17Jan 13, 2021Updated 5 years ago
- Machine learning evaluation database☆24Feb 7, 2018Updated 8 years ago
- Another, hopefully better, implementation of ALS on Spark☆14May 20, 2015Updated 10 years ago
- Tensorflow implementation of a Neural Attention Model for Abstractive Summarization.☆10Jul 20, 2020Updated 5 years ago
- ☆20Dec 1, 2016Updated 9 years ago
- Cascading and Scalding wrapper for HBase with advanced read features☆54Feb 11, 2020Updated 6 years ago
- Pattern-of-Behavior Search Tool☆11Jun 20, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Word2Vec - Google's word2vec in Scala using UMASS factorie library for better hacking and research.☆16Apr 7, 2014Updated 12 years ago
- Glint: High performance scala parameter server☆170Jul 20, 2018Updated 7 years ago
- Dropwizard Metrics reporter for Apache Spark☆28Dec 22, 2014Updated 11 years ago
- ☆62Jul 11, 2019Updated 6 years ago
- Yahoo!'s topic modelling framework using Latent Dirichlet Allocation☆98Sep 21, 2011Updated 14 years ago
- Automatically exported from code.google.com/p/jbirch☆12Sep 6, 2022Updated 3 years ago
- JVM related exercises☆11Jul 16, 2017Updated 8 years ago
- Parallel ML System - STRADS scheduler☆30Oct 4, 2018Updated 7 years ago
- ☆20Nov 16, 2014Updated 11 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Memory consumption estimator for Scala/Java☆27Nov 24, 2014Updated 11 years ago
- MaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.☆14Apr 12, 2022Updated 4 years ago
- A focused web crawler based on Playwright, RMQ, Kafka and Flink.☆14Feb 4, 2021Updated 5 years ago
- A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0☆26Aug 5, 2021Updated 4 years ago
- 《Redis 命令速查表》☆14Nov 6, 2017Updated 8 years ago
- ☆21Feb 9, 2023Updated 3 years ago
- Manual for RStudio Server☆16Oct 5, 2013Updated 12 years ago
- Simple UDF to split JSON arrays into Hive arrays☆10Jun 24, 2016Updated 9 years ago
- Cassandra river for Elastic search.☆37Jul 15, 2013Updated 12 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- All presentations from Data Fest Kyiv 2017 http://datafest.in.ua☆13Apr 24, 2017Updated 9 years ago
- Python scripts to facilitate easy working☆11Mar 23, 2026Updated last month
- Computer Science, Data Science and ML Fundamentals☆11May 30, 2025Updated 11 months ago
- This project is for the notebooks, code, and data for the "Vocabulary Analysis of Job Descriptions" tutorial at PyData 2017 Seattle☆20Jul 12, 2017Updated 8 years ago
- A Multi Layer Perceptron (MLP) Artificial Neural Network (ANN) Framework Developed in C for Machine Learning (ML) and Deep Learning (DL)☆11May 4, 2025Updated 11 months ago
- Group project for the WorldQuant University module, risk management.☆13Feb 3, 2019Updated 7 years ago
- Code for Springer Book: High Performance Distributed Computing: Case Studies with Hadoop, Scalding and Spark☆15Oct 6, 2017Updated 8 years ago