Mirror of Apache Apex malhar
☆134Nov 13, 2019Updated 6 years ago
Alternatives and similar repositories for apex-malhar
Users that are interested in apex-malhar are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Mirror of Apache Apex core☆350Jun 7, 2021Updated 4 years ago
- Mirror of Apache Apex site☆10Apr 29, 2025Updated 11 months ago
- Cascading on Apache Flink®☆54Feb 5, 2024Updated 2 years ago
- Llama - Low Latency Application MAster☆35Jun 27, 2022Updated 3 years ago
- Simple authentication for Zeppelin☆11Jul 16, 2015Updated 10 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆30Feb 1, 2016Updated 10 years ago
- Large RDF hierarchies as vector spaces☆20Jun 27, 2014Updated 11 years ago
- Set up tools for running a few DL libraries on CDH and CDSW☆17Jul 23, 2020Updated 5 years ago
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,545Updated this week
- Apache Spark - A unified analytics engine for large-scale data processing☆16Jul 24, 2023Updated 2 years ago
- ☆110Apr 17, 2017Updated 8 years ago
- prototype kubernetes operator for couchDB☆17Sep 5, 2017Updated 8 years ago
- Efficient, distributed downloads of large files from S3 to HDFS using Spark.☆17Apr 26, 2017Updated 8 years ago
- Convert real-time bidding (RTB) models to the AppNexus Bonsai language☆15Oct 17, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Google Dataflow Runner for Apache Flink™ (deprecated; please use the up-to-date Beam Runner)☆88Jul 7, 2016Updated 9 years ago
- Enabling queries on compressed data.☆282Dec 16, 2023Updated 2 years ago
- Demo Ambari service to deploy/manage NiFi on HDP - Deprecated☆75Jul 24, 2018Updated 7 years ago
- WikiXMLJ provides easy access to Wikipedia XML dumps.☆21Jun 1, 2017Updated 8 years ago
- Scala library for accessing various file, batch systems, job schedulers and grid middlewares.☆29Updated this week
- A simple Wikipedia talk page parser☆11May 10, 2018Updated 7 years ago
- produce a stream of citiation data coming off wikimedia☆12Mar 28, 2017Updated 9 years ago
- Mirror of Apache Myriad (Incubating)☆154Jul 11, 2023Updated 2 years ago
- A cookbook for installing and configuring Apache Spark☆11Sep 6, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The scripts to benchmark Linkerd2 proxy using Fortio☆17Jan 23, 2019Updated 7 years ago
- ☆48Feb 4, 2018Updated 8 years ago
- Clojure template engine for generating HTML-based markup☆21Aug 24, 2015Updated 10 years ago
- Client for Mediawiki Api☆13Dec 30, 2025Updated 3 months ago
- Mirror of Apache Helix☆494Apr 4, 2026Updated last week
- Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...☆645Dec 17, 2023Updated 2 years ago
- EserKnife☆14May 11, 2018Updated 7 years ago
- ☆13Aug 15, 2014Updated 11 years ago
- Alenka JDBC is a library for accessing and manipulating data with the open-source GPU database Alenka.☆20Jul 3, 2014Updated 11 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A new object-graph-wrapper for the Tinkerpop 3 graph stack.☆41Mar 31, 2021Updated 5 years ago
- Mirror of Apache Bahir☆336Jul 7, 2023Updated 2 years ago
- Cucumber-based framework for defining and executing SQL unit, integration and acceptance tests (for AWS Redshift, PostgreSQL)☆13Sep 30, 2020Updated 5 years ago
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆1,113Jan 12, 2023Updated 3 years ago
- An example PySpark project with pytest☆18Oct 13, 2017Updated 8 years ago
- Monad transformers for exception handling☆17Aug 19, 2024Updated last year
- Distributed Temporal Graph Analytics with Apache Flink☆251Jan 11, 2026Updated 2 months ago