Resilient data pipeline framework running on Apache Spark
☆26Updated this week
Alternatives and similar repositories for pramen
Users that are interested in pramen are comparing it to the libraries listed below
Sorting:
- Dynamic Conformance Engine☆32Oct 17, 2025Updated 4 months ago
- Extensible streaming ingestion pipeline on top of Apache Spark☆46Jul 17, 2025Updated 7 months ago
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆29Nov 4, 2024Updated last year
- Scala API for Apache Spark SQL high-order functions☆14Aug 4, 2023Updated 2 years ago
- Friendly, Scala like, Sequence interface☆12Jan 13, 2026Updated last month
- Efficiently automate your release note generation with 'generate-release-notes'. This GH action scans your target GitHub repository's iss…☆12Updated this week
- Avro SerDe for Apache Spark structured APIs.☆241Jun 10, 2025Updated 8 months ago
- replace '__all__' with '@public.add' decorator☆15Dec 3, 2020Updated 5 years ago
- A COBOL parser and Mainframe/EBCDIC data source for Apache Spark☆158Updated this week
- R COBOL DI (Data Integration) Package : Import COBOL CopyBook data files directly into R as properly structured data frames.☆15Aug 7, 2024Updated last year
- Python stream processing with RisingWave☆21Jan 29, 2026Updated last month
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Jan 21, 2020Updated 6 years ago
- A multi-platform file-configurable folder comparison tool with html-reporting written in rust☆12Feb 13, 2026Updated 2 weeks ago
- Helpers & syntactic sugar for PySpark.☆62Dec 4, 2025Updated 2 months ago
- ☆30Jul 2, 2024Updated last year
- Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.☆76Apr 24, 2024Updated last year
- Nested array transformation helper extensions for Apache Spark☆37Aug 4, 2023Updated 2 years ago
- dbc is the command-line tool for installing and managing ADBC drivers☆89Updated this week
- IBM z/OS core collection via FTP☆11Sep 17, 2025Updated 5 months ago
- Implementation of core-expansion algorithm☆11Jan 26, 2026Updated last month
- ☆34Updated this week
- Common utilities for Apache Kafka☆36Aug 7, 2023Updated 2 years ago
- ☆10Jan 28, 2025Updated last year
- Cl app / pre-commit hook to clean Jupyter Notebooks metadata, execution_count and optionally output.☆11Mar 3, 2025Updated 11 months ago
- A parser for IBM JCL.☆15Oct 25, 2019Updated 6 years ago
- Kafka Connect JSONata Transform☆12Feb 24, 2025Updated last year
- JCL to script generate☆10Jun 3, 2024Updated last year
- An SBT Plugin that acts as a light wrapper around Buf.☆10Oct 29, 2024Updated last year
- Pekko Streams support for JSON via Circe☆10Nov 10, 2025Updated 3 months ago
- Transactional Machine Learning using Data Streams and AutoML☆14Oct 5, 2025Updated 4 months ago
- ☆15Jul 25, 2025Updated 7 months ago
- ☆12Mar 7, 2025Updated 11 months ago
- This repo demonstrates an Apache Arrow Flight server implementation in Kubernetes.☆12Oct 25, 2024Updated last year
- An open source accent conversion model based on the real time voice cloning repository☆12May 10, 2024Updated last year
- Incan: a modern, Pythonic language that compiles to Rust! Type-safe, async-friendly, with fixtures, testing, and web/inter-op built in.☆12Updated this week
- General-purpose implementations of ERC-792 Arbitrables.☆12Jun 21, 2024Updated last year
- A Bun plugin which converts .csv and .tsv files into JavaScript modules.☆17Nov 12, 2024Updated last year
- Parser combinator library for Elixir☆13Feb 11, 2026Updated 2 weeks ago
- Rust SDK for Claude Code CLI - Build production-ready AI agents with type safety☆19Oct 24, 2025Updated 4 months ago