使用spark对hive、hbase、ES的读写, 实现一次配置可对不同数据库进行导入导出,并对ES、hbase进行封装
☆32May 6, 2017Updated 9 years ago
Alternatives and similar repositories for sparkForDB
Users that are interested in sparkForDB are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 此工程采用SpringBoot + Mybatis + SparkSQL + Hive框架进行集成,支持Kerberos认证。☆21Mar 19, 2018Updated 8 years ago
- 基于springbook+spark的机器学习应用开发☆12Nov 21, 2022Updated 3 years ago
- 基于redis的分布式锁,适用于秒杀,自增ID等web分布式开发场景☆11Mar 21, 2017Updated 9 years ago
- 针对一维时间序列数据,采用GMM和K-Means算法进行异常点检测。For one-dimensional time series data, GMM and K-means algorithm are used to detect outliers.☆11Jan 16, 2021Updated 5 years ago
- elasticsearch-jdbc,在elasticsearch-sql的jdbc实验特性基础上完成,可使用sql和rest api的方式执行elasticsearch操作☆18Mar 8, 2019Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 一个基于ElasticSearch的业务日志记录工具☆10Nov 5, 2018Updated 7 years ago
- POC for all the stack of big data (kafka, spark, cassandra, hdfs, docker, springboot)☆12Dec 16, 2022Updated 3 years ago
- 专注大数据 Spark ML 机器学习:监督学习、无监督学习,主要有:分类算法、回归算法、聚类算法、推荐算法、频繁模式挖掘算法☆17Nov 6, 2020Updated 5 years ago
- SpringBoot + Apache Mahout 推荐引擎 基于用户评分数据推荐相关电影☆11Jun 7, 2018Updated 7 years ago
- SparkStreaming中利用MySQL保存Kafka偏移 量保证0数据丢失☆43Aug 2, 2017Updated 8 years ago
- 【易车】- Spark、flink、HBase、Hive、flume集成了一些Hadoop的原生api的一些demo(如HDFS、MapReduce:目前就这两个);同时测试一些异常功能☆16Apr 4, 2019Updated 7 years ago
- 基于canal.deployer-1.1.1-SNAPSHOT.tar,canal连接kafka,springboot消费kafka数据入hbase和ElasticSearch☆15Dec 19, 2018Updated 7 years ago
- 大数据平台相关代码(ES/Hive/Hadoop/hdfs/hbase)☆74Nov 4, 2022Updated 3 years ago
- ☆13Aug 13, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 微信小程序之大数据共享单车项目☆25Jun 21, 2022Updated 3 years ago
- 视频教育网站☆17Sep 25, 2018Updated 7 years ago
- springboot+MySQL+mybatis☆14Aug 26, 2024Updated last year
- 拉比克是一个开源大数据平台构建方案,已稳定应用于生产集群。融合Hadoop、Hive、Hbase、zookeeper等如CDH☆14Mar 11, 2019Updated 7 years ago
- 在规格文件上直接执行SQL,无数据库依赖,基于Java8的流计算和Lamdba表达式。☆10Jun 15, 2017Updated 8 years ago
- 分布式锁的几种实现方法:redis实现分布式锁☆12Dec 5, 2016Updated 9 years ago
- Spring Cloud微服务架构教程By Gary。每个目录对应着教程里每个组件的项目demo,运行起来即是完整的微服务集群。史上最豪华全家桶套餐,星宿老仙法力无边~☆11Jan 8, 2019Updated 7 years ago
- a syslog server&client which is used to receive/convert/send the logs.☆11Dec 14, 2022Updated 3 years ago
- Pinot 是一个实时分布式的 OLAP 数据存储和分析系统。LinkedIn 使用它实现低延迟可伸缩的实时分析。Pinot 从离线数据源(包括 Hadoop 和各类文件)和在线数据源(如 Kafka)中攫取数据进行分析。Pinot 被设计是可以进行水平扩展的☆16Nov 8, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 在公司接了一个任务,完成一个项目数据同步的模块。要求是不能操作项目的数据库。怕操作不当,数据丢失。所以想到的方案是使用log4jdbc记录数据源的SQL语句到日志文件。然后按行读取日志文件中的数据,记录读取的Point,以便下次继续读取。读取的数据进入bigqueue队列,…☆12Aug 10, 2017Updated 8 years ago
- Spark(multi versions) + Streaming/Hive/SQL/UDF Demos☆15May 17, 2018Updated 8 years ago
- better performance for kylin query☆15Jun 14, 2019Updated 6 years ago
- java爬虫,反爬虫策略、ETL清洗数据,以及spark离线和实时分析新闻并存入ES☆19Nov 26, 2018Updated 7 years ago
- SpringBoot+Mybatis+MySQL☆17Dec 8, 2016Updated 9 years ago
- spring+spark streaming+kafka 10版本集成和异常问题处理☆17Jul 21, 2017Updated 8 years ago
- HBase操作封装的orm: easy-hbase,更方便的使用HBase☆19Jan 16, 2024Updated 2 years ago
- sql 解析引擎 探索☆16Dec 29, 2017Updated 8 years ago
- 🔍使用elasticsearch的java api进行from&size和scroll分页操作。☆10Jun 17, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Streaming 相关项目☆15Mar 27, 2017Updated 9 years ago
- 监控维度:1.监控内容信息采集;2.监控对象Url,Spring,数据源,异常,jvm,服务信息;3.监控策略处理☆12Nov 2, 2018Updated 7 years ago
- Structured Streaming is a reference application showing how to easily integrate structured streaming Apache Spark Structured Streaming, …☆13Nov 17, 2018Updated 7 years ago
- mongodb的使用☆13Dec 27, 2017Updated 8 years ago
- elasticsearch+hbase海量数据查询,支持千万数据秒回查询☆280Jan 1, 2017Updated 9 years ago
- zookeeper java客户端的封装,使用者不用再关心断线重连、watch监听、session过期等问题,并使用zookeeper 实现了分布式锁☆32Feb 9, 2018Updated 8 years ago
- Zdal是支付宝自主研发的数据中间件产品,采用标准的JDBC规范,可以在分布式环境下看上去像传统数据库一样提供海量数据服务,是一种通用的分库分表数据库访问框架,解决单库单表数据库访问压力,Zdal主要提供分库分表,结果集合并,sql解析,数据库failover动态切换等功能…☆18Dec 17, 2018Updated 7 years ago