使用knn和朴素贝叶斯算法预测居民出行目的地,主要基于Scala和python语言编写,运行在spark分布式集群。
☆10Jun 21, 2022Updated 3 years ago
Alternatives and similar repositories for Destination-Prediction-
Users that are interested in Destination-Prediction- are comparing it to the libraries listed below
Sorting:
- 为了了解观看热门电影的用户都有哪些特征,爬取猫眼网站热门电影的评论数据进行分析:评分统计、词云、城市评论数量与平均评分、性别分析、评论数量与时间的关系。☆10Nov 14, 2019Updated 6 years ago
- CUDA code with exact k-NN algorithm for multiple GPU system.☆12Jul 5, 2024Updated last year
- 基于python的贝叶斯分类算法(数据集为Iris_data)☆16May 11, 2018Updated 7 years ago
- A distributed implementation of AdaBoost.MH and MP-Boost using Apache Spark☆18Jul 7, 2016Updated 9 years ago
- Implement some ML algorithms in scala☆21Jul 25, 2023Updated 2 years ago
- ☆35Mar 20, 2022Updated 3 years ago
- DBSCAN implementation using Apache Spark☆48Feb 2, 2018Updated 8 years ago
- spark 机器学习:利用jupyter工作来讲解算法原理并运行相关例子☆107Dec 1, 2016Updated 9 years ago
- 模拟电商系统上线运行一段时间后,根据收集到大量的用户行为数据,利用大数据技术(Flink)进行深入挖掘和分析,进而得到感兴趣的商业指标并增强对风险的控制。 整体可以分为 用户行为习惯数据和业务行为数据两大类。用户的行为习惯数据包括了用户的登录方式、上线的时间点及时长、点击和浏…☆138May 5, 2020Updated 5 years ago
- kmeans clustering with multi-GPU capabilities☆123Apr 18, 2023Updated 2 years ago
- Spark 学习之路,包含 Spark Core,Spark SQL,Spark Streaming,Spark mllib 学习笔记☆145Jul 3, 2018Updated 7 years ago
- An implementation of DBSCAN runing on top of Apache Spark☆183Jan 10, 2018Updated 8 years ago
- The Nak Machine Learning Library☆343Jul 18, 2017Updated 8 years ago
- Spark-2.3.1源码解读☆201Dec 5, 2022Updated 3 years ago
- SparkMLlibDeepLearn深度学习☆208Aug 3, 2015Updated 10 years ago
- k-Nearest Neighbors algorithm on Spark☆240Nov 14, 2023Updated 2 years ago
- 如果你在从事大数据BI的工作,想对比一下MySQL、GreenPlum、Elasticsearch、Hive、Spark SQL、Presto、Impala、Drill、HAWQ、Druid、Pinot、Kylin、ClickHouse、Kudu等不同实现方案之间的表现,…☆285May 24, 2018Updated 7 years ago
- 基于Apache Spark的Netflix电影的离线与实时推荐系统☆248Mar 21, 2017Updated 8 years ago
- Implement FedAvg algorithm based on Tensorflow☆265Dec 6, 2020Updated 5 years ago
- Course : Introduction to Computer Systems☆234Jan 9, 2019Updated 7 years ago
- 《Computer Systems A Programmer's Perspective Second Edition》 important Labs☆332Jul 28, 2016Updated 9 years ago
- [毕业设计] 承接毕业设计系统讲解+调试 承接Java SpringBoot项目和大数据部署与运行问题排查;联系闲鱼 【闲鱼】https://m.tb.cn/h.ToqQJM5?tk=hM92egXVW4w CZ057 「我在闲鱼发布了【[hot][hot]IT资料帮找Ja…☆346Feb 6, 2025Updated last year
- Everything you want about DP-Based Federated Learning, including Papers and Code. (Mechanism: Laplace or Gaussian, Dataset: femnist, shak…☆421Oct 26, 2024Updated last year
- 【大数据必备】非科班转行Java 大数据面经分享☆465Jul 1, 2022Updated 3 years ago
- 基于Spark2.x新闻网大数据实时分析可视化系统项目☆536Mar 28, 2019Updated 6 years ago
- 💎🔥大数据学习笔记☆681May 13, 2019Updated 6 years ago
- 专业程序员修炼之路。☆2,790Dec 1, 2020Updated 5 years ago
- 定期更新Hadoop生态圈中常用大数据组件文档 重心依次为: Flink Solr Sparksql ES Scala Kafka Hbase/phoenix Redis Kerberos (项目包含hadoop思维导图 印象笔记 Scala版本简单demo …☆924Mar 23, 2023Updated 2 years ago
- 电商用户行为分析大数据平台☆1,091Nov 16, 2022Updated 3 years ago
- Learn CUDA Programming, published by Packt☆1,231Dec 30, 2023Updated 2 years ago
- Apache Impala☆1,267Updated this week
- 北邮研究生导师口碑榜☆1,051Nov 20, 2017Updated 8 years ago
- 《UNIX环境高级编程》中文第三版笔记☆1,391Jan 25, 2019Updated 7 years ago
- Several simple examples for popular neural network toolkits calling custom CUDA operators.☆1,528Apr 29, 2021Updated 4 years ago
- [大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结☆1,650Aug 30, 2021Updated 4 years ago
- Sample codes for my CUDA programming book☆2,020Dec 14, 2025Updated 2 months ago
- 1st Place Solution for CrowdFlower Product Search Results Relevance Competition on Kaggle.☆1,775Sep 25, 2021Updated 4 years ago
- spark ml 算法原理剖析以及具体的源码实现分析☆1,962Mar 25, 2019Updated 6 years ago
- FedML - The Research and Production Integrated Federated Learning Library: https://fedml.ai☆2,003Sep 3, 2022Updated 3 years ago