xiaokugua250/DataMingProject

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xiaokugua250/DataMingProject)

xiaokugua250 / DataMingProject

大数据平台相关代码（ES/Hive/Hadoop/hdfs/hbase）

☆74

Alternatives and similar repositories for DataMingProject

Users that are interested in DataMingProject are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

singgel / BigData-skillTree
View on GitHub
【易车】- Spark、flink、HBase、Hive、flume集成了一些Hadoop的原生api的一些demo（如HDFS、MapReduce：目前就这两个）；同时测试一些异常功能
☆16Apr 4, 2019Updated 7 years ago
hongs01 / ltybdservice-root
View on GitHub
蓝泰源大数据基础平台
☆17Mar 7, 2018Updated 8 years ago
xinghalo / Teddy
View on GitHub
Spark Streaming监控平台，支持任务部署与告警、自启动
☆130Mar 29, 2018Updated 8 years ago
realxujiang / bigtable-sql
View on GitHub
分布式大数据SQL查询可视化界面！
☆67Sep 29, 2015Updated 10 years ago
oeljeklaus-you / UserActionAnalyzePlatform
View on GitHub
电商用户行为分析大数据平台
☆1,123Nov 16, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
monsonlee / BigData
View on GitHub
BigData Project 大数据项目由浅入深
☆644Nov 30, 2017Updated 8 years ago
elevy30 / bigdata-playground
View on GitHub
POC for all the stack of big data (kafka, spark, cassandra, hdfs, docker, springboot)
☆12Dec 16, 2022Updated 3 years ago
winstonelei / BigDataTools
View on GitHub
tools for bigData
☆39Dec 19, 2018Updated 7 years ago
changge458 / userdraw
View on GitHub
用户画像代码，根据算法推算出用户的性别和年龄比率
☆11Dec 18, 2017Updated 8 years ago
zhenu14 / kettle
View on GitHub
Java调用Kettle API执行转换和作业，Java代码生成Kettle转换。
☆21Mar 9, 2018Updated 8 years ago
zengxiaosen / eshop
View on GitHub
电商+大数据+spark机器学习
☆17Dec 5, 2017Updated 8 years ago
lichaojacobs / kylin-jdbc-pool
View on GitHub
better performance for kylin query
☆15Jun 14, 2019Updated 7 years ago
garyudeng / Crawer
View on GitHub
各大电商网站数据抓取分析
☆31Sep 17, 2013Updated 12 years ago
344399160 / sparkForDB
View on GitHub
使用spark对hive、hbase、ES的读写，实现一次配置可对不同数据库进行导入导出，并对ES、hbase进行封装
☆32May 6, 2017Updated 9 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
hedy2009c / hdfs
View on GitHub
基于hadoop，利用ssh框架实现hdfs网盘
☆27Sep 5, 2012Updated 13 years ago
wyc941012 / MachineLearningPlatform
View on GitHub
基于Spark和Kubernetes的机器学习平台
☆31Mar 13, 2018Updated 8 years ago
TopSpoofer / hbrdd
View on GitHub
一个为spark批量导入数据到hbase的库
☆43Nov 18, 2016Updated 9 years ago
lttoto / SparkStreamingProject
View on GitHub
SparkStreaming项目，显示flume->Kafka->Spark->hbase(实时数据处理方案)，Scala实现
☆37Feb 19, 2018Updated 8 years ago
datamaning / MapReduce
View on GitHub
清华大数据作业MapReduce处理几百个G的JSON数据
☆50Jun 27, 2016Updated 10 years ago
SwordfallYeung / Interview_BigData
View on GitHub
关于大数据的面试题，包括hadoop、hbase、hive、spark、storm、zookeeper、kafka、flume、logstash、redis、ELK、ETL、算法等等，持续更新中
☆448Mar 31, 2019Updated 7 years ago
ligo / flowml
View on GitHub
流程化机器学习框架基于 scala java语言 ,一站式自动机器学习平台 ,主要包括数据分析特征工程，机器模型，自动部署，超参数优化，模型自动优化，自动扩容分配创建功能，类似第四范式、阿里PAI平台、google autoMl、亚马逊SageMaker
☆68Aug 1, 2018Updated 7 years ago
jiaoqiyuan / 163-bigdate-note
View on GitHub
bigdata note
☆40Jul 20, 2023Updated 3 years ago
horrificer / vagary
View on GitHub
数据库访问中间件，统一的标准sql查询，底层可以是不同的数据库包括mysql、ElasticSearch、kylin、presto等。
☆14Apr 21, 2018Updated 8 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
linjiayue / BigDataSalaryAnaliySystem
View on GitHub
大数据招聘信息分析平台
☆46Feb 25, 2016Updated 10 years ago
wemakebug / Knowledge
View on GitHub
基于知识图谱技术的搜素引擎研发
☆19Apr 24, 2017Updated 9 years ago
ruanjianlxm / panda-config
View on GitHub
基于zookeeper的分布式配置管理中心，在分布式系统中，配置文件经常多而繁杂，更新容易丢，有了这个组件，可以热更新，并且不会哪台机子上漏了哪个配置。
☆36Feb 15, 2016Updated 10 years ago
houshanren / big_data_architect_skills
View on GitHub
一个大数据架构师应该掌握的技能
☆476Sep 2, 2019Updated 6 years ago
seawaylee / spark-rec-v2
View on GitHub
Spark混合推荐系统大数据监控平台
☆11May 1, 2018Updated 8 years ago
sdksdk0 / Elasticsearch-Hbase
View on GitHub
elasticsearch+hbase海量数据查询,支持千万数据秒回查询
☆281Jan 1, 2017Updated 9 years ago
oeljeklaus-you / JavaOrBigData-Interview
View on GitHub
Java开发者或者大数据开发者面试知识点整理
☆253Feb 25, 2019Updated 7 years ago
Will-Grindelwald / Storm-Kafka
View on GitHub
Storm Kafka 流数据处理系统
☆20Oct 10, 2018Updated 7 years ago
nodicmyth / DW_ETL
View on GitHub
数据仓库KETTLE ETL资源库
☆14Jun 11, 2015Updated 11 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kobelzy / WIFI-Analysis
View on GitHub
WiFi大数据分析项目
☆105Jun 9, 2018Updated 8 years ago
wengbenjue / spark_recomend
View on GitHub
使用Spark的MLlib、Hbase作为模型、Hive作数据清洗的核心推荐引擎,在Spark on Yarn测试通过
☆30Mar 9, 2017Updated 9 years ago
windwant / bigdata-service
View on GitHub
hadoop flume hbase kafka storm；读取kafka数据=》storm实时处理（分割字符，统计字符）=》写入hdfs
☆21Sep 21, 2018Updated 7 years ago
wanghan0501 / WiFiProbeAnalysis
View on GitHub
基于WIFI探针的商业大数据分析技术
☆303Nov 16, 2022Updated 3 years ago
dbaplus / DBAplus_Newsletter
View on GitHub
为广大技术爱好者提供数据库行业的最新技术发展趋势，为技术发展提供一个统一的发声平台。为此，我们策划了RDBMS、NoSQL、NewSQL、大数据、虚拟化、时间序列、国产数据库等几个版块的内容。
☆20Nov 7, 2017Updated 8 years ago
liushuishang / JobSchedule
View on GitHub
基于TBSchedule开发的一个分布式任务调度框架，可以解析任务间的依赖，并执行任务（执行Shell、bat脚本）
☆12Aug 5, 2016Updated 9 years ago
zwqjsj0404 / HBase-Research
View on GitHub
HBase数据库源代码学习研究(包括代码注释、文档、用于代码分析的测试用例)
☆10May 18, 2017Updated 9 years ago