This project is a unified ETL platform that support various data processing technologies, including Spark, Hive, Hadoop, Python, Linux Shell script, etc.
☆17Oct 16, 2015Updated 10 years ago
Alternatives and similar repositories for BigETL
Users that are interested in BigETL are comparing it to the libraries listed below
Sorting:
- Few things we've met during our etl project based on spark☆24Mar 22, 2018Updated 7 years ago
- ☆34Feb 8, 2022Updated 4 years ago
- Extension and scripts to run analogue of sysbench OLTP test using pgbench☆12Oct 1, 2016Updated 9 years ago
- ☆10May 25, 2017Updated 8 years ago
- 基于TBSchedule开发的一个分布式任务调度框 架,可以解析任务间的依赖,并执行任务(执行Shell、bat脚本)☆12Aug 5, 2016Updated 9 years ago
- Interactive summary of Gartner's Magic Quadrant for Web Content Management with d3.js☆17Oct 15, 2012Updated 13 years ago
- MConn is a framework to build custom service-discovery-solutions on top Mesosphere's Marathon☆10Jul 27, 2015Updated 10 years ago
- 观点型问题阅读理解 challenger.ai☆10Nov 14, 2018Updated 7 years ago
- Template for a DuckDB-based, Codespace-oriented sandbox project that is also dbt Cloud compatible, and includes code-first BI tooling via…☆16Apr 7, 2023Updated 2 years ago
- https://github.com/uavorg/uavstack☆10Sep 11, 2017Updated 8 years ago
- Apache Spark based ETL Engine☆71Oct 18, 2016Updated 9 years ago
- A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.☆12Apr 1, 2017Updated 8 years ago
- Be a silentor,just focus on mark your words down!☆12Jul 18, 2015Updated 10 years ago
- ☆15Aug 25, 2014Updated 11 years ago
- Code for the Adzuna Salary Prediction Kaggle competition - http://www.kaggle.com/c/job-salary-prediction Placed 10th out of approximately…☆11Apr 10, 2013Updated 12 years ago
- 使用springboot+mybatis后台框架 前端bootstrap框架 添加web socket实时提醒订单消息 使用springsecurity进行权限拦截 邮箱+短信验证 ,echart图表显示用户订单信息,poi表报打印等等。。。☆12Oct 25, 2017Updated 8 years ago
- PostgreSQL bgworker for easy replication monitoring☆14Mar 13, 2026Updated last week
- This is an experimental exercise I'm using to develop a point of view on ingesting messages from IoT devices and persisting those message…☆12Apr 24, 2018Updated 7 years ago
- 批量处理小工具:①批量将word文档转换为pdf②给pdf文档批量添加水印☆12May 11, 2022Updated 3 years ago
- 用户画像代码,根据算法推算出用户的性别和年龄比率☆11Dec 18, 2017Updated 8 years ago
- Implementation of a Recommendation Engine for Reddit☆12Nov 19, 2014Updated 11 years ago
- Query PostgreSQL internals using SQL☆28May 27, 2019Updated 6 years ago
- Last-seen sketch implementation in Go☆16Dec 15, 2020Updated 5 years ago
- docker image to deploy rabbitmq cluster on mesos with one marathon app☆10Oct 12, 2017Updated 8 years ago
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Aug 1, 2016Updated 9 years ago
- echarts大屏展示得数据可视化☆11Sep 6, 2018Updated 7 years ago
- 基于sqoop封装的一个通用的抽取工具,方便数据平台界面提交任务以及数据源管理☆10May 2, 2017Updated 8 years ago
- Spark Streaming与OpenCV传感器数据实时获取☆13Jun 20, 2016Updated 9 years ago
- ☆14Apr 30, 2024Updated last year
- POC for all the stack of big data (kafka, spark, cassandra, hdfs, docker, springboot)☆12Dec 16, 2022Updated 3 years ago
- Example project which simulates an interesting analytics use case using MemSQL Pipelines.☆14Apr 25, 2017Updated 8 years ago
- hive sql parser☆11Aug 27, 2014Updated 11 years ago
- Mastering Mesos by Packt Publishing☆12Jan 30, 2023Updated 3 years ago
- Utilities for data cleaning and ETL processing☆24Dec 14, 2017Updated 8 years ago
- Utility to monitor AWS Redshift Performance☆12Jul 6, 2016Updated 9 years ago
- Set of ETL utils for Spark☆15May 4, 2020Updated 5 years ago
- KBQA☆14Mar 13, 2017Updated 9 years ago
- 离线调度, hive, 任务依赖, 任务调度, 大数据开发平台☆14May 10, 2018Updated 7 years ago
- BM25F demo with lucene using BlendedTermQuery and a custom similarity☆15Oct 11, 2016Updated 9 years ago