KyloIO / kylo
Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.
☆21Updated 5 years ago
Related projects: ⓘ
- ☆29Updated this week
- an data-centric integration platform☆47Updated 3 years ago
- Guardian of Waterdrop and Spark☆30Updated last year
- Custom Service for deploying Apache Alluxio on a running HDP 2.3 / IOP 4.1 Ambari Managed Cluster☆13Updated 7 years ago
- Kettle plugins for Apache Beam☆42Updated last year
- A web application for submitting spark application☆8Updated 3 years ago
- Ansible playbooks to help to deploy Apache Hadoop,Spark,Storm,Zookeeper,Elasticsearch,Azkaban,Flume,Hbase,Kafka,Kibana,Logstash☆10Updated 7 years ago
- Import data from clickhouse to hadoop with pure SQL☆36Updated 5 years ago
- phoenix☆12Updated last year
- Flink image for Kubernetes that fixes Jobmanage connection issue☆23Updated 6 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated 7 months ago
- an open source dataworks platform☆21Updated 3 years ago
- Data self exporting and monitoring platform based on Hive data warehouse. https://hc.smartloli.org☆35Updated 7 years ago
- presto for Cloudera Manager parcel☆20Updated 8 years ago
- Apache Phoenix Query Server☆47Updated this week
- ☆52Updated this week
- ☆15Updated 2 years ago
- ☆11Updated this week
- 延云ydb千亿大数据实时解决方案☆31Updated 7 years ago
- A demo repository for "streaming etl" with Apache Flink☆43Updated 8 years ago
- Distributed SQL query engine for big data☆32Updated 5 years ago
- Capture changes of HBase to Kafka☆30Updated 8 years ago
- IoT Trucking App with Flink (with Table API & SQL)☆15Updated 6 years ago
- some useful User Defined Functions(UDF) for both PrestoSQL and TrinoDB☆18Updated last year
- Ambari service for Presto☆44Updated 4 years ago
- A library based on delta for Spark and MLSQL☆61Updated 3 years ago
- ☆14Updated 2 years ago
- 使用Hive读写solr☆31Updated 2 years ago
- DataTunnel 是一个基于spark引擎的超高性能的分布式数据集成软件,支持海量数据的同步。基于spark extensions 扩展的DSL语法,结合的Spark SQL,更加便捷融入数仓 ETLT 过程中,简单易用。☆15Updated this week
- This is a datasource implementation for quick query in Kafka with Spark☆9Updated 11 months ago