This project is a unified ETL platform that support various data processing technologies, including Spark, Hive, Hadoop, Python, Linux Shell script, etc.
☆17Oct 16, 2015Updated 10 years ago
Alternatives and similar repositories for BigETL
Users that are interested in BigETL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Few things we've met during our etl project based on spark☆24Mar 22, 2018Updated 8 years ago
- MonQ - PostgreSQL extension for MongoDB-like queries to jsonb data☆17Jul 12, 2017Updated 8 years ago
- Interactive summary of Gartner's Magic Quadrant for Web Content Management with d3.js☆17Oct 15, 2012Updated 13 years ago
- Drools processor for Apache NiFi☆39Oct 23, 2019Updated 6 years ago
- MConn is a framework to build custom service-discovery-solutions on top Mesosphere's Marathon☆10Jul 27, 2015Updated 10 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Apache Spark based ETL Engine☆71Oct 18, 2016Updated 9 years ago
- Code of the book "Getting started with the Julia Programming Language"☆11Jul 7, 2018Updated 7 years ago
- ☆15Aug 25, 2014Updated 11 years ago
- Be a silentor,just focus on mark your words down!☆12Jul 18, 2015Updated 10 years ago
- Go based utilities for working with Apache Mesos Frameworks☆12Apr 22, 2016Updated 10 years ago
- 使用shell脚本部署Apache Doris (incubating) FE & BE☆11Jul 8, 2019Updated 6 years ago
- 批量处理小工具:①批量将word文档转换为pdf②给pdf文档批量添加水印☆12May 11, 2022Updated 3 years ago
- 用户画像代码,根据算法推算出用户的性别和年龄比率☆11Dec 18, 2017Updated 8 years ago
- Implementation of a Recommendation Engine for Reddit☆12Nov 19, 2014Updated 11 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Query PostgreSQL internals using SQL☆28May 27, 2019Updated 6 years ago
- 在线编辑pdf文档☆11Jun 21, 2022Updated 3 years ago
- 数据中后台高阶组件☆12Apr 10, 2026Updated 3 weeks ago
- Last-seen sketch implementation in Go☆16Dec 15, 2020Updated 5 years ago
- docker image to deploy rabbitmq cluster on mesos with one marathon app☆10Oct 12, 2017Updated 8 years ago
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Aug 1, 2016Updated 9 years ago
- disruptor-learn☆13Dec 24, 2018Updated 7 years ago
- 有向无环图在大文本匹配N多关键字中的应用☆11Jul 21, 2018Updated 7 years ago
- 华为软件精英挑战赛2019,实时计算全图路况,每辆车在每个时刻(或隔几个时刻)根据自身信息生成自己的权重矩阵,利用SPFA算法动态规划路径☆10Apr 14, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyVidarDB is a simple, fast, and persistent key-value store that can store terabytes of data. It is the Python binding for VidarDB.☆21Jul 18, 2022Updated 3 years ago
- echarts大屏展示得数据可视化☆11Sep 6, 2018Updated 7 years ago
- Spark Streaming与OpenCV传感器数据实时获取☆13Jun 20, 2016Updated 9 years ago
- 基于sqoop封装的一个通用的抽取工具,方便数据平台界面提交任务以及数据源管理☆10May 2, 2017Updated 9 years ago
- Scribe Apache module for logging☆25Nov 16, 2009Updated 16 years ago
- POC for all the stack of big data (kafka, spark, cassandra, hdfs, docker, springboot)☆12Dec 16, 2022Updated 3 years ago
- 基于springbook+spark的机器学习应用开发☆12Nov 21, 2022Updated 3 years ago
- A demo repository for "streaming etl" with Apache Flink☆44Jun 8, 2016Updated 9 years ago
- Mastering Mesos by Packt Publishing☆12Jan 30, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Utilities for data cleaning and ETL processing☆24Dec 14, 2017Updated 8 years ago
- Utility to monitor AWS Redshift Performance☆12Jul 6, 2016Updated 9 years ago
- Set of ETL utils for Spark☆15May 4, 2020Updated 6 years ago
- KBQA☆14Mar 13, 2017Updated 9 years ago
- wac☆27Dec 7, 2022Updated 3 years ago
- 离线调度, hive, 任务依赖, 任务调度, 大数据开发平台☆14May 10, 2018Updated 7 years ago
- Rust implementation of the Filecoin protocol☆12Aug 25, 2020Updated 5 years ago