"The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.
☆270Jul 12, 2023Updated 2 years ago
Alternatives and similar repositories for styx
Users that are interested in styx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ephemeral Hadoop clusters using Google Compute Platform☆135Mar 31, 2022Updated 3 years ago
- Runs JVM closures in Docker containers on Kubernetes☆36Mar 23, 2018Updated 7 years ago
- A lightweight workflow definition library☆155Jul 15, 2022Updated 3 years ago
- A Scala API for Apache Beam and Google Cloud Dataflow.☆2,620Updated this week
- Java/Scala library for easily authoring Flyte tasks and workflows☆44Jan 13, 2026Updated 2 months ago
- Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code☆297Jan 31, 2025Updated last year
- GCS support for avro-tools, parquet-tools and protobuf☆79May 5, 2025Updated 10 months ago
- Minimal value types for Java☆78Feb 27, 2026Updated 3 weeks ago
- Control Plane for Flyte. Flyteadmin is a gRPC + REST Service written in golang and uses a RDBMs to store meta information and management …☆39Oct 9, 2023Updated 2 years ago
- DBeam exports SQL tables into Avro files using JDBC and Apache Beam☆194Oct 28, 2025Updated 4 months ago
- Flyte Flink k8s plugin.☆20Jan 29, 2025Updated last year
- Scio IDEA plugin☆30Oct 2, 2025Updated 5 months ago
- Java library for working with Guava futures☆140Feb 7, 2024Updated 2 years ago
- A Scala feature transformation library for data science and machine learning☆473Feb 7, 2025Updated last year
- Flyte Backend Plugins contributed by the Flyte community.☆29Oct 9, 2023Updated 2 years ago
- ☆23Jan 3, 2025Updated last year
- Building Scio from scratch step by step☆20May 20, 2019Updated 6 years ago
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆53Oct 9, 2023Updated 2 years ago
- ☆54Aug 3, 2017Updated 8 years ago
- The Heroic Time Series Database☆846Mar 26, 2021Updated 4 years ago
- A collection of Magnolia add-on modules☆182Feb 12, 2026Updated last month
- An asynchronous memcache client for Java☆154Dec 17, 2024Updated last year
- Capturing meaningful metrics in your Java application☆67Jul 26, 2024Updated last year
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,694Mar 7, 2026Updated 2 weeks ago
- Docker container orchestration platform☆2,211Sep 12, 2024Updated last year
- ☆13Apr 16, 2018Updated 7 years ago
- Mesos Integration Tests on Docker/Ec2☆15May 25, 2023Updated 2 years ago
- Scala Aggregators used for ML Model metrics monitoring☆92Sep 13, 2023Updated 2 years ago
- The user interface for Flyte☆43Jul 31, 2025Updated 7 months ago
- The Flyte data-sidecar that helps move the input and output data intelligently between containers☆10Oct 9, 2023Updated 2 years ago
- [SUNSET] Async Google Pubsub Client☆158Mar 18, 2023Updated 3 years ago
- A Giter8 template for scio☆31Feb 3, 2026Updated last month
- A cross platform CLI for Flyte. Written in Golang. Offers an intuitive interface to Flyte https://docs.flyte.org/projects/flytectl/en/lat…☆52May 23, 2024Updated last year
- Botoflow is an asynchronous framework for Amazon SWF that helps you build SWF applications using Python☆13Dec 26, 2022Updated 3 years ago
- Distributed Load generation Platform with Server Side Monitoring Capabilities.☆24Oct 30, 2015Updated 10 years ago
- A DataFusion-powered Serverless S3 Proxy.☆17Apr 15, 2024Updated last year
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,520Updated this week
- Iceberg is a table format for large, slow-moving tabular data☆490Apr 10, 2023Updated 2 years ago
- Kafka to Avro Writer based on Apache Beam. It's a generic solution that reads data from multiple kafka topics and stores it on in cloud s…☆25Apr 7, 2021Updated 4 years ago