dw etl 工具 mysql 增量、全量抽取 to hive. 合并 hive 数据表, 等数据平台清洗工具
☆10Dec 21, 2016Updated 9 years ago
Alternatives and similar repositories for dw_etl
Users that are interested in dw_etl are comparing it to the libraries listed below
Sorting:
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Nov 29, 2016Updated 9 years ago
- 新闻聚合与订阅系统后端☆15May 9, 2019Updated 6 years ago
- java爬虫,反爬虫策略、ETL清洗数据,以及spark离线和实时分析新闻并存入ES☆19Nov 26, 2018Updated 7 years ago
- 一个优秀的大数据查询平台,提供hive异步任务查询、LDAP用户、数据权限控制、历史查询任务与结果存储、邮件通知、excel下载等功能。☆24Dec 30, 2017Updated 8 years ago
- FPtree algorithm to mining frequent pattern☆20Aug 6, 2013Updated 12 years ago
- A tool for translating Scala source code into readable and maintainable Java code☆13Jan 3, 2026Updated 2 months ago
- 【合并到至轻云】☆25Jun 19, 2025Updated 8 months ago
- JPMML 加载 PMML 模型进行 predict☆29Aug 10, 2020Updated 5 years ago
- 让数据分析师可以有比Excel更好的使用体验 a spreadsheet component to make data analysis easier☆10Jan 3, 2020Updated 6 years ago
- Spark Sql进行离线日志分析,Java Web+Echarts+Ajax进行数据可视化展示☆27Sep 10, 2018Updated 7 years ago
- spring-boot利用scala写spark程序骨架☆28Oct 22, 2019Updated 6 years ago
- A batch-processing system base on Spring Boot and Spring Batch. 一个基于SpringBoot和SpringBatch的批处理系统。☆10Sep 10, 2018Updated 7 years ago
- Apache Spark’s classic Word Count example with Spring Boot.☆11Apr 21, 2023Updated 2 years ago
- Apache Geode on Kubernetes☆10Oct 19, 2019Updated 6 years ago
- A distributed data factory, providing data access, etl, scheduling. Easily manage tasks such as hive, spark, clickhouse, flink, shell, py…☆33May 21, 2022Updated 3 years ago
- A solution for UpLoad data(TXT,ScreenShot) to server,Contact with PHP ( 数据上传解决方案,比如上传log信息,上传屏幕截图,PHP后端交互存储文件)☆11Aug 22, 2018Updated 7 years ago
- ☆12Jan 5, 2019Updated 7 years ago
- zdh系列-基于java的经营风控引擎☆13Jan 24, 2026Updated last month
- This Pinyin Analysis plugin is used to do conversion between Chinese characters and Pinyin.☆10Mar 28, 2019Updated 6 years ago
- 这是居于 derby 源代码,通过删减的方式,从里面抽取出sql解析功能。并在此基础上开发出跨库连接查询器。通过该工具可以将连接查询分割成多个单表查询,再将单表结果集进行连接,即将数据库的连接功能上移到工具执行。详情可以查看wiki:readme☆10Feb 14, 2017Updated 9 years ago
- Vimeo Google Analytics & Google Tag Manager Embed Tracking Edit Add topics☆10Sep 29, 2017Updated 8 years ago
- ☆11Sep 1, 2022Updated 3 years ago
- ☆10Mar 29, 2022Updated 3 years ago
- Ambari Custom Service to deploy MongoDb in a cluster however you want: as a sharding cluster; as a replicaset or standalone☆15Mar 4, 2018Updated 8 years ago
- ChatBI UI☆10May 26, 2023Updated 2 years ago
- Spark projects. Learning book "Machine Learning with Spark"☆10Jun 3, 2017Updated 8 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Jun 29, 2017Updated 8 years ago
- ServiceFramework 示例项目☆10Apr 2, 2016Updated 9 years ago
- Run Deekseek LLM model locally with Ollama, deepseek-r1:1.5b, and React☆11Jan 29, 2025Updated last year
- 包含猫眼电影、豆瓣、b站、微博、天气预报、Metacritic、Pokemon图鉴等,爬取信息并保存在对应的文件中。☆10Jan 1, 2025Updated last year
- Meedan's Open Source Arabic/English Translation Memory☆33Nov 4, 2009Updated 16 years ago
- ansible plugins used by xiaomi☆10Oct 13, 2018Updated 7 years ago
- Something Temporary☆10Oct 18, 2018Updated 7 years ago
- Creates a Lucene index out of files from a local folder☆13Aug 8, 2014Updated 11 years ago
- LightRAG with Neo4j Example Project☆17May 19, 2025Updated 9 months ago
- 利用公开的安然财务和邮件数据集,利用 PCA 和特征选择分析处理缺失的数据,再通过朴素贝叶斯、决策树、SVM等机器学习构建筛选器,找出有欺诈嫌疑的安然员工☆10Dec 7, 2017Updated 8 years ago