Python implementation of k-means clustering algorithm in MapReduce.
☆18Apr 7, 2019Updated 7 years ago
Alternatives and similar repositories for hadoop-kmeans
Users that are interested in hadoop-kmeans are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Mastering Hadoop 3, published by Packt☆20Jan 30, 2023Updated 3 years ago
- 创造自己的工具集,build for fun🎉☆17May 13, 2023Updated 3 years ago
- go and blockchain study note,欢迎各位志同道合的朋友一起完善,让更多的go或者区块链开发者能够有一份不错的学习资料☆13Oct 5, 2018Updated 7 years ago
- Sanskrit compound segmentation using seq2seq model☆26Sep 29, 2018Updated 7 years ago
- Code repository for Java Data Science Cookbook, published by Packt☆25Jan 30, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Dijkstra Algorithm - Python Hadoop Streaming and Pyspark☆23Jun 6, 2018Updated 8 years ago
- MapReduce by examples☆100Apr 16, 2019Updated 7 years ago
- Notes talking about the design and implementation of Apache Spark☆19Dec 4, 2020Updated 5 years ago
- 实现视频字幕和横竖屏切换☆30Jun 15, 2019Updated 7 years ago
- 网站点击流离线日志分析☆19Sep 13, 2018Updated 7 years ago
- use tcp implement, not http server, base on golang☆29Mar 14, 2026Updated 3 months ago
- This project implements different Deep Autoencoder for Collaborative Filtering for Recommendation Systems in Keras☆53Nov 28, 2019Updated 6 years ago
- Keras implementation of AutoRec and DeepRecommender from Nvidia.☆62Dec 31, 2019Updated 6 years ago
- 基于spark streaming+flume+kafka+hbase的实时日志处理分析系统(分为控制台版本和基于springboot、Echarts等的Web UI可视化版本)☆41Jul 20, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 基于Hadoop的Web日志分析,包括日志的清洗、日志的统计分析、统计结果的导出、指标数据的Web展示☆41Feb 9, 2022Updated 4 years ago
- ☆51Mar 12, 2020Updated 6 years ago
- This is a package in Python which implements a tokenizer, stemmer for Hindi language☆94Oct 2, 2020Updated 5 years ago
- Using Deep Autoencoders for predictions of movie ratings.☆113May 9, 2023Updated 3 years ago
- 开始Scrapy实战如:存数据库、下载文件、爬京东、淘宝、Anti-Anti-Spider……☆424Apr 22, 2018Updated 8 years ago
- 基于spark的外卖大数据平台分析系统☆47Dec 16, 2018Updated 7 years ago
- A full featured walkthough of using django on zappa (powered by AWS Lambda in serverless environment)☆183May 15, 2023Updated 3 years ago
- ☆141Jan 28, 2021Updated 5 years ago
- Data for the quantitative study of (Vedic) Sanskrit☆157Mar 5, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- xfspell — the Transformer Spell Checker☆189Jun 18, 2020Updated 6 years ago
- ShuffleNet-V2 for both PyTorch and Caffe.☆504Aug 9, 2018Updated 7 years ago
- Built a simple chatbot from a sequence-to-sequence model with TensorFlow.☆147Mar 7, 2019Updated 7 years ago
- ✏️[计算机基础+java基础+大数据基础及进阶+面试指南] 一份涵盖计算机基础,java,大数据,面试宝典,大部分核心知识的项目,学习,面试,共同进步!☆90Jul 6, 2023Updated 2 years ago
- UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.☆149Apr 4, 2025Updated last year
- 大数据组件 All-in-One 的 Dockerfile☆99Nov 19, 2024Updated last year
- 🏖 Easy training and deployment of seq2seq models.☆227Mar 26, 2021Updated 5 years ago
- 🕷一些Scrapy爬虫的练手项目☆76Apr 30, 2019Updated 7 years ago
- Java网上图书商城,项目基于MVC设计模式,采用B/S结构☆71Feb 19, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone☆131Oct 10, 2023Updated 2 years ago
- Golang的ebiten引擎做的 传奇小demo☆108Feb 3, 2023Updated 3 years ago
- 📦 原创 开发的 爬虫实用工具 【特定代理池】【特定cookies池】【注册辅助工具】☆118Oct 4, 2019Updated 6 years ago
- Jupyter notebooks for pyspark tutorials given at University☆111Jan 7, 2026Updated 5 months ago
- Modern C++ Programming Cookbook, Second Edition, published by Packt☆216Aug 7, 2022Updated 3 years ago
- 用go刷leetcode,已更新2000+常见面试算法题目,提供多种解题思路☆129Jul 8, 2025Updated 11 months ago
- PyTorch implementations of Top-N recommendation, collaborative filtering recommenders.☆145Jul 20, 2021Updated 4 years ago