Big Data ETL and Utilities for Hadoop Map Reduce, Spark and Storm
☆106Jan 22, 2024Updated 2 years ago
Alternatives and similar repositories for chombo
Users that are interested in chombo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Jun 29, 2017Updated 8 years ago
- Apache Spark based ETL Engine☆71Oct 18, 2016Updated 9 years ago
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆30Apr 15, 2026Updated 3 weeks ago
- Scala API for Apache Spark SQL high-order functions☆14Aug 4, 2023Updated 2 years ago
- A pyspark lib to validate data quality☆19Nov 11, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆21Oct 1, 2015Updated 10 years ago
- ☆10Apr 10, 2014Updated 12 years ago
- Library and a Framework for building fast, scalable, fault-tolerant Data APIs based on Akka, Avro, ZooKeeper and Kafka☆25Oct 16, 2020Updated 5 years ago
- ☆25Oct 12, 2016Updated 9 years ago
- flinksql-platform☆19Mar 22, 2021Updated 5 years ago
- Open source task scheduler with dependency management☆15Jul 1, 2018Updated 7 years ago
- Repository of Notebooks taken from https://neo4j.com/graph-algorithms-book/☆26Feb 21, 2020Updated 6 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Oct 20, 2020Updated 5 years ago
- Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.☆76Apr 24, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- An example of how to use the JDBC to issue Hive queries from a Java client application.☆11Apr 5, 2018Updated 8 years ago
- ☆32Mar 21, 2018Updated 8 years ago
- Go Plug-ins & Vendored Dependencies: A Solution☆11Jun 1, 2017Updated 8 years ago
- Example API Access SmartApp that shows the state and allows control of devices☆12Mar 11, 2026Updated last month
- Build configuration-driven ETL pipelines on Apache Spark☆162Oct 4, 2022Updated 3 years ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆125Apr 23, 2026Updated last week
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆63Sep 6, 2024Updated last year
- Distributed optimization framework with parameter server☆23Jun 14, 2015Updated 10 years ago
- Multi-stage, config driven, SQL based ETL framework using PySpark☆26Sep 16, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Mar 14, 2021Updated 5 years ago
- Distributed SQL query engine for running interactive analytic queries against big data sources.☆10Jul 1, 2016Updated 9 years ago
- 优化flink的多流操作(例如join),优化点不限于数据丢失问题,以及性能问题☆11Apr 8, 2019Updated 7 years ago
- A containerized development environment for Go, includes automatic code reloads, test running, and vendored dependencies. Powered by Dock…☆12Feb 21, 2017Updated 9 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Sep 8, 2022Updated 3 years ago
- Implementation of a Big Data (batch and stream) distributed processing engine in Java using Akka actors.☆12Feb 20, 2023Updated 3 years ago
- A fork of cascading patterns, but implemented for trident☆72Dec 16, 2023Updated 2 years ago
- Scala API for distributed closures on Apache Ignite☆11Jun 6, 2015Updated 10 years ago
- 使用spark + kudu的案例☆15Sep 13, 2017Updated 8 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Drools processor for Apache NiFi☆39Oct 23, 2019Updated 6 years ago
- Data ingestion examples☆11Feb 12, 2015Updated 11 years ago
- Java event logs collector for hadoop and frameworks☆42Mar 25, 2025Updated last year
- THIS REPOSITORY IS DEPRECATED☆19Jul 6, 2023Updated 2 years ago
- [Desperate] 饿了么-大数据部门常用 UI 组件库☆14May 18, 2018Updated 7 years ago
- Cloud based Data Platform based on Apache Spark☆28Apr 24, 2026Updated last week
- An Ansible collection of utilities and other resources for Cloudera Platform deployments☆13Updated this week