Some popular algorithms(dbscan,knn,fm etc.) on spark
☆32May 29, 2018Updated 8 years ago
Alternatives and similar repositories for AlgorithmsOnSpark
Users that are interested in AlgorithmsOnSpark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An example project that combines Spark Streaming, Kafka, and Parquet to transform JSON objects streamed over Kafka into Parquet files in …☆19Jun 22, 2021Updated 4 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- An implementation of DBSCAN runing on top of Apache Spark☆182Jan 10, 2018Updated 8 years ago
- An analysis on Aadhaar dataset using Mapreduce and Spark☆14Feb 28, 2018Updated 8 years ago
- doddle-model code examples☆19Sep 23, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- DBSCAN clustering algorithm implemented in Apache Spark (MapReduce Framework).☆13May 5, 2016Updated 10 years ago
- Using k-d trees with Apache Spark and Scala☆11Jul 3, 2015Updated 10 years ago
- ☆20Feb 28, 2018Updated 8 years ago
- Python and Scala APIs for enhanced Spark analytics☆12Mar 15, 2017Updated 9 years ago
- Invoke Pandas plotting by piping in SQL output via PSQL (Can be used with Postgres or Greenplum or any SQL engine).☆16Nov 8, 2014Updated 11 years ago
- An example project using Spark Streaming with Kafka message and Avro serialization☆12Aug 21, 2015Updated 10 years ago
- Spark On Angel, arming Spark with a powerful Parameter Server, which enable Spark to train very big models☆85Jan 2, 2023Updated 3 years ago
- Docker image for Dataiku Science Studio☆10Apr 20, 2017Updated 9 years ago
- ☆14Nov 3, 2016Updated 9 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Factorization Machines on Spark and Glint☆25Nov 7, 2016Updated 9 years ago
- High performance HBase / Spark SQL engine☆28Jul 7, 2022Updated 3 years ago
- spark性能调优总结 spark config and tuning☆118Mar 9, 2018Updated 8 years ago
- Exploration of Convolutional Neural Networks using DeepLearning4J and Scala for Kaggle competition on Yelp Photo Classification☆12Nov 3, 2016Updated 9 years ago
- Problems can be found over - https://www.hackerrank.com/domains/shell/bash/☆13Jan 20, 2015Updated 11 years ago
- Affinity Propagation on Spark☆20May 31, 2021Updated 5 years ago
- XGBoost on Spark for Chinese Text Classification☆46May 31, 2018Updated 8 years ago
- Rossmann Store Sales: https://www.kaggle.com/c/rossmann-store-sales☆10May 13, 2018Updated 8 years ago
- 主要解决ctr预估工程中的特征选择,特征编号(特征离散),单特征auc和logloss这3个问题.☆20Mar 30, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Working example of consuming Avro data from Kafka with Spark Streaming☆12Feb 21, 2016Updated 10 years ago
- Subset Met Office MOGREPS-UK and UKV on AWS EC2☆12Oct 22, 2021Updated 4 years ago
- ☆24Mar 11, 2016Updated 10 years ago
- Plot live-stats as graph from ApacheSpark application using Lightning-viz☆18Jul 3, 2017Updated 8 years ago
- ☆11May 8, 2020Updated 6 years ago
- ☆13Oct 16, 2020Updated 5 years ago
- ☆19Jun 27, 2025Updated 11 months ago
- Contain Interview Questions Solutions☆12May 18, 2018Updated 8 years ago
- End-to-end Machine Learning Pipeline demo using Delta Lake, MLflow and AzureML in Azure Databricks☆18Nov 9, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Kafka delivery semantics in the case of failure depend on how and when offsets are stored. Spark output operations are at-least-once. So …☆37Apr 19, 2017Updated 9 years ago
- CAIL-CCL-2019相似案例匹配三等奖解决方案☆14Oct 28, 2019Updated 6 years ago
- The Nak Machine Learning Library☆342Jul 18, 2017Updated 8 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Apr 18, 2017Updated 9 years ago
- ☆36Mar 17, 2026Updated 2 months ago
- network embedding and recommendation system☆10Jul 2, 2019Updated 6 years ago
- Featureselection methods as Spark MLlib Pipelines☆31Apr 29, 2018Updated 8 years ago