Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pipelines.
☆97May 20, 2026Updated last week
Alternatives and similar repositories for flowman
Users that are interested in flowman are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A library enabling DAG structuring of data processing programs such as ETLs☆17Apr 13, 2026Updated last month
- ☆11May 16, 2022Updated 4 years ago
- Create a data mart using Azure Data Factory as ELT / ETL, Azure Synapse as database and Power BI as visualization tool.☆19Apr 20, 2022Updated 4 years ago
- A Python Library to support running data quality rules while the spark job is running⚡☆202May 19, 2026Updated last week
- Set of ETL utils for Spark☆15May 4, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code examples for the Introduction to Kubeflow course☆15Jan 12, 2021Updated 5 years ago
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆160Dec 10, 2022Updated 3 years ago
- A simplified, lightweight ETL Framework based on Apache Spark☆588Jan 24, 2024Updated 2 years ago
- Kubernetes operator for Apache Hadoop HDFS used by the Stackable Data Platform☆52Updated this week
- Observability Python library - Powered by Kensu☆22Oct 15, 2024Updated last year
- ☆12Jul 10, 2022Updated 3 years ago
- Template to deploy a Data Product for data stream processing into a Data Landing Zone of the Data Management & Analytics Scenario (former…☆36Jul 17, 2023Updated 2 years ago
- Spark data profiling utilities☆23Nov 24, 2018Updated 7 years ago
- Kafka Kubernetes Authenticator and Authorizer☆12Sep 5, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- In-Memory Java Compiler☆12Oct 13, 2020Updated 5 years ago
- Multi-stage, config driven, SQL based ETL framework using PySpark☆26Sep 16, 2019Updated 6 years ago
- A map transformer which implements the `Stream Maps` capability from Meltano's tap and target SDK: https://sdk.meltano.com/☆19Updated this week
- ZIO wrapper for AWS S3 SDK async client☆11Feb 21, 2020Updated 6 years ago
- Custom XML and JSON marshallers for Grails in an easy way☆30Oct 18, 2016Updated 9 years ago
- ☆12Dec 2, 2025Updated 5 months ago
- React Bootstrap 4 Tabs Component