Python implementation of k-means clustering algorithm in MapReduce.
☆18Apr 7, 2019Updated 6 years ago
Alternatives and similar repositories for hadoop-kmeans
Users that are interested in hadoop-kmeans are comparing it to the libraries listed below
Sorting:
- ☆10Oct 17, 2019Updated 6 years ago
- Apriori algorithm implementation☆14Feb 2, 2018Updated 8 years ago
- ✨ Drop-in replacement to net.Conn with pooling and auto-reconnect☆18Dec 1, 2025Updated 3 months ago
- With Red Hen Lab’s Rapid Annotator we try to enable researchers worldwide to annotate large chunks of data in a very short period of time…☆15May 14, 2024Updated last year
- Review the state of art works of handwritten signature verification works☆20Jan 22, 2019Updated 7 years ago
- 一个交通大数据可视化系统☆27Sep 1, 2017Updated 8 years ago
- Dijkstra Algorithm - Python Hadoop Streaming and Pyspark☆23Jun 6, 2018Updated 7 years ago
- Notes talking about the design and implementation of Apache Spark☆19Dec 4, 2020Updated 5 years ago
- the note of hbase☆25Jul 1, 2022Updated 3 years ago
- 一个基于微信公众号的智能聊天机器人项目,支持根据关键字或者调用OpenAI、通义千问、豆包等大语言模型服务回复内容☆31Jan 14, 2025Updated last year
- Nav-manage 为静态导航带来强大的管理扩展☆29May 5, 2025Updated 10 months ago
- use tcp implement, not http server, base on golang☆28Jun 13, 2024Updated last year
- 可在Linux下使用的天翼电信校园网客户端 - Node.js版☆23May 14, 2018Updated 7 years ago
- MapReduce by examples☆100Apr 16, 2019Updated 6 years ago
- 实现视频字幕和横竖屏切换☆30Jun 15, 2019Updated 6 years ago
- 一键傻瓜式部署centos服务器环境shell脚本☆31Dec 30, 2025Updated 2 months ago
- ☆25Jan 13, 2021Updated 5 years ago
- GO代码实现的最小区块链Demo (一共200行代码不到)☆37May 12, 2018Updated 7 years ago
- 基于spark streaming+flume+kafka+hbase的实时日志处理分析系统(分为控制台版本和基于springboot、Echarts等的Web UI可视化版本)☆39Jul 20, 2023Updated 2 years ago
- 基于Hadoop的Web日志分析,包括日志的清洗、日志的统计分析、统计结果的导出、指标数据的Web展示☆43Feb 9, 2022Updated 4 years ago
- A tool for handwritten text (straight and skewed) line segmentation based on a statistical approach.☆40Jun 29, 2018Updated 7 years ago
- 基于 Python 实现登录和登出广东天翼校园网的命令行工具☆50Jul 3, 2025Updated 8 months ago
- 各大视频站URL播放地址解析☆55Nov 29, 2018Updated 7 years ago
- Big-Interleaved-Dataset☆58Jan 21, 2023Updated 3 years ago
- 基于movielens数据集的电影推荐系统☆48Nov 22, 2022Updated 3 years ago
- 😮python模拟登陆一些大型网站,还有一些简单的爬虫,希望对你们有所帮助❤️,如果喜欢记得给个star哦🌟☆34Oct 1, 2020Updated 5 years ago
- ☆51Mar 12, 2020Updated 5 years ago
- 一个实时数仓项目,从0到1搭建实时数仓☆64May 27, 2021Updated 4 years ago
- 基于Spark2.2新闻网大数据实时系统项目☆62Apr 3, 2019Updated 6 years ago
- 基于Django和Hadoop集群进行的大数据分析平台☆70Nov 25, 2017Updated 8 years ago
- The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Pos…☆76Feb 27, 2023Updated 3 years ago
- 爬虫项目☆70Oct 14, 2018Updated 7 years ago
- Java网上图书商城,项目基于MVC设计模式,采用B/S结构☆72Feb 19, 2022Updated 4 years ago
- 爬取B站up视频详细信息,并进行可 视化☆102May 23, 2024Updated last year
- 基于协同过滤和spark-als的电影推荐系统☆92Nov 22, 2022Updated 3 years ago
- Jupyter notebooks for pyspark tutorials given at University☆110Jan 7, 2026Updated 2 months ago
- 此推荐系统类似网易云音乐推荐歌单以及推荐相似歌曲☆95May 2, 2018Updated 7 years ago
- 开始Scrapy实战如:存数据库、下载文件、爬京东、淘宝、Anti-Anti-Spider……☆424Apr 22, 2018Updated 7 years ago
- ☆88Mar 5, 2023Updated 3 years ago