Wrangler Transform: A DMD system for transforming Big Data
☆106Feb 12, 2026Updated 2 weeks ago
Alternatives and similar repositories for wrangler
Users that are interested in wrangler are comparing it to the libraries listed below
Sorting:
- Cask Hydrator Plugins Repository☆68Dec 19, 2025Updated 2 months ago
- Database plugins☆13Feb 2, 2026Updated last month
- CDAP UI☆20Feb 23, 2026Updated last week
- CDAP Kubernetes Operator☆19Nov 19, 2025Updated 3 months ago
- An open source framework for building data analytic applications.☆785Updated this week
- A collection of Google Cloud Platform (GCP) plugins☆49Feb 23, 2026Updated last week
- Cloud Spanner Connector for Apache Spark☆17Updated this week
- ☆14May 24, 2017Updated 8 years ago
- This project shows how you can create a reporting dashboard in an Electron.js app that runs on multiple platforms.☆20Aug 26, 2020Updated 5 years ago
- Hive Storage Handler for interoperability between BigQuery and Apache Hive☆19Jan 29, 2025Updated last year
- An embedded job scheduler.☆117Jul 29, 2024Updated last year
- ☆21Aug 26, 2025Updated 6 months ago
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆18Updated this week
- Form Plugin☆22Apr 7, 2023Updated 2 years ago
- Data abstraction, storage, discovery, and serving system☆35Jan 30, 2026Updated last month
- Library and a Framework for building fast, scalable, fault-tolerant Data APIs based on Akka, Avro, ZooKeeper and Kafka☆25Oct 16, 2020Updated 5 years ago
- Example of how you can generate docx document based on template docx with dynamic header row generation, dynamic data rows generation and…☆24Feb 3, 2018Updated 8 years ago
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆29Nov 4, 2024Updated last year
- spark-sight: Spark performance at a glance☆10Apr 6, 2023Updated 2 years ago
- Codeless end-to-end testing framework which makes your testing easier☆44Updated this week
- Google Data Studio connector designed for the Open Data Kit OData API.☆11May 18, 2022Updated 3 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29May 15, 2020Updated 5 years ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Dec 31, 2024Updated last year
- Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.☆289Updated this week
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆1,113Jan 12, 2023Updated 3 years ago
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆303Oct 30, 2025Updated 4 months ago
- BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.☆421Feb 19, 2026Updated last week
- ☆31Oct 17, 2018Updated 7 years ago
- Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。☆33Apr 12, 2022Updated 3 years ago
- Metl is a simple, web-based integration platform that allows for several different styles of data integration including messaging, file b…☆213Feb 9, 2026Updated 3 weeks ago
- Использование инструмента Draw.io для создания схем Terraform развертываний.☆10Dec 18, 2025Updated 2 months ago
- ☆11Apr 22, 2022Updated 3 years ago
- EncryCore node reference implementation☆15Apr 2, 2020Updated 5 years ago
- ☆11Mar 1, 2021Updated 5 years ago
- MaxiCP☆17Updated this week
- Example Java JUnit consumer☆12Feb 23, 2026Updated last week
- X-definition 4.2 (Open Source Software)☆15Updated this week
- ☆17May 22, 2025Updated 9 months ago
- Java Text-based and Scanned PDF data extraction via stream/lattice/OCR-hybrid in Tabular form.☆28Jan 28, 2026Updated last month