intel-spark/StatisticsOnSpark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/intel-spark/StatisticsOnSpark)

intel-spark / StatisticsOnSpark

Assembly of fundamental statistics implemented based on Apache Spark

☆31

Alternatives and similar repositories for StatisticsOnSpark

Users that are interested in StatisticsOnSpark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

intel-spark / SparseML
View on GitHub
Spark MLlib code optimized to efficiently support sparse data
☆51Dec 22, 2016Updated 9 years ago
intel-spark / TopicModeling
View on GitHub
Topic Modeling on Apache Spark
☆94Mar 1, 2019Updated 7 years ago
DS-BootCamp-DSI-Columbia / AY2017-2018-Winter-Collaboratory
View on GitHub
Winter Break Collaboratory DS Boot Camp during the academic year of 2017-2018
☆14Feb 12, 2018Updated 8 years ago
jamartinh / Orange3-Spark
View on GitHub
A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML
☆15Dec 24, 2016Updated 9 years ago
carpedm20 / practice-tensorflow
View on GitHub
☆21Dec 9, 2015Updated 10 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
augboost-anon / augboost
View on GitHub
Gradient Boosting Enhanced with Step-Wise Feature Augmentation
☆17Jan 13, 2021Updated 5 years ago
zaleslaw / Spark-Tutorial
View on GitHub
How to build your first Spark application with MLlib, StructuredStreaming, GraphFrames, Datasets and so on? Answer is here!
☆53Nov 5, 2019Updated 6 years ago
1202kbs / MemN2N-Tensorflow
View on GitHub
Implementation of End-To-End Memory Networks with Tensorflow for bAbI Dataset
☆11Aug 17, 2017Updated 8 years ago
fabiopetroni / libfm_with_BPR
View on GitHub
☆20Dec 1, 2016Updated 9 years ago
levarml / LeVar
View on GitHub
Machine learning evaluation database
☆24Feb 7, 2018Updated 8 years ago
elephantscale / learning-scala
View on GitHub
☆14Aug 24, 2021Updated 4 years ago
mengxr / spark-als
View on GitHub
Another, hopefully better, implementation of ALS on Spark
☆14May 20, 2015Updated 11 years ago
webblearning / Neural-Attention-Model-For-Abstractive-Sentence-Summarization
View on GitHub
Tensorflow implementation of a Neural Attention Model for Abstractive Summarization.
☆10Jul 20, 2020Updated 6 years ago
mitll / graph-qube
View on GitHub
Pattern-of-Behavior Search Tool
☆11Jun 20, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
giscience-fsu / sperrorest
View on GitHub
Spatial error estimation and variable importance
☆20Jan 30, 2025Updated last year
vatel / scala-akka-monitoring
View on GitHub
Experiments with monitoring of Akka actors
☆20Sep 15, 2013Updated 12 years ago
ippontech / metrics-spark-reporter
View on GitHub
Dropwizard Metrics reporter for Apache Spark
☆28Dec 22, 2014Updated 11 years ago
perdisci / jbirch
View on GitHub
Automatically exported from code.google.com/p/jbirch
☆12Sep 6, 2022Updated 3 years ago
alchemyst / Segmentation
View on GitHub
Timeseries segmentation library
☆12Mar 8, 2023Updated 3 years ago
XingyuGit / spark-birch
View on GitHub
Spark Implementation of BIRCH Clustering algorithm
☆13Feb 18, 2020Updated 6 years ago
sailing-pmls / strads
View on GitHub
Parallel ML System - STRADS scheduler
☆30Oct 4, 2018Updated 7 years ago
phatak-dev / java-sizeof
View on GitHub
Memory consumption estimator for Scala/Java
☆27Nov 24, 2014Updated 11 years ago
udsclub / DataFestKyiv2017
View on GitHub
All presentations from Data Fest Kyiv 2017 http://datafest.in.ua
☆13Apr 24, 2017Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
alexander-n-thomas / pydata-vocab-analysis
View on GitHub
This project is for the notebooks, code, and data for the "Vocabulary Analysis of Job Descriptions" tutorial at PyData 2017 Seattle
☆20Jul 12, 2017Updated 9 years ago
huangzworks / redis-cheatsheet
View on GitHub
《Redis 命令速查表》
☆14Nov 6, 2017Updated 8 years ago
cloneofsimo / inversion_edits
View on GitHub
☆21Feb 9, 2023Updated 3 years ago
lizhitao0923 / ansible-hadoop
View on GitHub
Ansible playbooks to help to deploy Apache Hadoop,Spark,Storm,Zookeeper,Elasticsearch,Azkaban,Flume,Hbase,Kafka,Kibana,Logstash
☆10Mar 21, 2017Updated 9 years ago
wangyuchen / rserver-manual
View on GitHub
Manual for RStudio Server
☆16Oct 5, 2013Updated 12 years ago
pythian / hive-json-split
View on GitHub
Simple UDF to split JSON arrays into Hive arrays
☆10Jun 24, 2016Updated 10 years ago
Jackal08 / sa_risk_management
View on GitHub
Group project for the WorldQuant University module, risk management.
☆13Feb 3, 2019Updated 7 years ago
titu1994 / Python-Work
View on GitHub
Python scripts to facilitate easy working
☆11Mar 23, 2026Updated 3 months ago
cfregly / spark-after-dark
View on GitHub
☆24Jul 2, 2015Updated 11 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
andrew-pete / Hockey-D3-Tutorial
View on GitHub
An introduction to Hockey Visualization with D3.js
☆15Mar 27, 2018Updated 8 years ago
anilmuppalla / hpdc-scalding-spark
View on GitHub
Code for Springer Book: High Performance Distributed Computing: Case Studies with Hadoop, Scalding and Spark
☆15Oct 6, 2017Updated 8 years ago
spider-123-eng / Hive-Pig-Hbase
View on GitHub
Hive,Pig,Hbase,Sqoop examples
☆15Apr 24, 2017Updated 9 years ago
algolia / talksearch
View on GitHub
🎤 An interactive search experience for video titles and transcripts
☆27Mar 3, 2023Updated 3 years ago
ravthiru / flink-cep-examples
View on GitHub
Apache Flink CEP examples
☆12Jul 22, 2016Updated 10 years ago
vogt-m / ccbmlib
View on GitHub
Modeling Tanimoto distributions for RDKit
☆18Feb 28, 2020Updated 6 years ago
toshi-k / kaggle-santander-customer-satisfaction
View on GitHub
44th place solution in "Santander Customer Satisfaction"
☆11May 16, 2016Updated 10 years ago