maropu / spark-data-repair-pluginView external linksLinks
Provide functionality to build statistical models to repair dirty tabular data in Spark
☆12Apr 21, 2023Updated 2 years ago
Alternatives and similar repositories for spark-data-repair-plugin
Users that are interested in spark-data-repair-plugin are comparing it to the libraries listed below
Sorting:
- 🚀 Validation DSL for data pipelines☆24Jun 12, 2018Updated 7 years ago
- TSG Client is a Python library for interacting with the TNO Security Gateway (TSG) Core Container☆18Mar 28, 2025Updated 10 months ago
- FederatedCatalog☆11Updated this week
- This project aims at doing performance testing of AWS Kinesis stream☆11May 16, 2020Updated 5 years ago
- GenericSpark☆10Jun 12, 2015Updated 10 years ago
- Just in Time Datastructures☆11Feb 21, 2017Updated 8 years ago
- Hadoop/Hive/Spark container to perform CI tests☆10Dec 26, 2020Updated 5 years ago
- ☆10Oct 31, 2019Updated 6 years ago
- Websocket Test Steps Plugin for Ready! API and SoapUI☆11Mar 14, 2025Updated 10 months ago
- Visualize column-level data lineage in Spark SQL☆92May 13, 2022Updated 3 years ago
- Maven packaging and lifecycle for Trino plugins☆15Jan 26, 2026Updated 2 weeks ago
- Latest: 7.0.0 - Lightweight and ready-to-use services to easily connect an IDS-Connector to different IDS-Infrastructure-Components.☆14Mar 4, 2024Updated last year
- Just a simple script that crawls Spotify tracks of a given Mixcloud cloudcast.☆13Feb 28, 2015Updated 10 years ago
- Infrastructure to run programs written in high-level languages on top of the Database Stream Processor (DBSP) runtime.☆16Jun 17, 2022Updated 3 years ago
- C 结构体与 JSON 快速互转库☆11Nov 27, 2017Updated 8 years ago
- MySQL® migration tool☆13Dec 12, 2025Updated 2 months ago
- A reasonably complete and well-tested golang port of httpbin, with zero dependencies outside the go stdlib.☆11Nov 24, 2025Updated 2 months ago
- Atlassian Bamboo and Bitbucket images for GKE clusters☆10Mar 24, 2022Updated 3 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- Data generator for Amazon MSK☆18May 7, 2024Updated last year
- Repository for the OAC (ODRL profile for Access Control) documentation: https://w3id.org/oac☆10Oct 20, 2024Updated last year
- ☆16Updated this week
- Simple Go 1.8 plugin test for https://jeremywho.com/go-1.8---plugins/☆10Feb 28, 2017Updated 8 years ago
- A reference implementation for the did:webs DID method specified here https://github.com/trustoverip/tswg-did-method-webs-specification. …☆13Oct 28, 2024Updated last year
- A simple controller to help create mirrors on Kubernetes☆10Oct 1, 2022Updated 3 years ago
- Sangria monix integration☆10Feb 2, 2026Updated last week
- Smoke (flame) chart library for D3.js users☆28Jun 21, 2020Updated 5 years ago
- ☆11Oct 17, 2016Updated 9 years ago
- A Fully HiveServer2-like Multi-tenancy Spark Thrift Server Supporting Impersonation and Multi-SparkContext with Ranger Authorization (GO …☆10Jul 7, 2022Updated 3 years ago
- Material for the Berlin Bayesian reading group covering Statistical Rethinking by Richard McElreath☆10May 7, 2020Updated 5 years ago
- smbus provides access to the System Management bus over I2C☆15Dec 16, 2020Updated 5 years ago
- Policy Administration point to handle ODRL policies and provide their Rego-equivalent to the Open Policy Agent☆11Feb 5, 2026Updated last week
- JVMCI examples for Java Day Tokyo 2017☆10Sep 30, 2019Updated 6 years ago
- Scala library for testing in production☆10Dec 9, 2020Updated 5 years ago
- Infra stuff to run Kubernetes on travisci☆10Mar 7, 2023Updated 2 years ago
- A complete golang implementation of Common industrial protocol☆10Dec 26, 2020Updated 5 years ago
- JDBC Driver for Treasure Data☆11May 1, 2024Updated last year
- Repository of the metadata specification mobilityDCAT-AP☆18Jan 22, 2026Updated 3 weeks ago
- Theo dõi biến động giá sản phẩm TIKI với Github Actions☆14Jan 16, 2022Updated 4 years ago