SparkConnect Server plugin and protobuf messages for the Amazon Deequ Data Quality Engine.
☆26Feb 22, 2025Updated last year
Alternatives and similar repositories for tsumugi-spark
Users that are interested in tsumugi-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A flake8 plugin that detects of usage withColumn in a loop or inside reduce☆28Jun 20, 2025Updated 9 months ago
- Convert MUSE from TensorFlow to PyTorch and ONNX☆11May 22, 2024Updated last year
- Delta lake and filesystem helper methods☆50Feb 29, 2024Updated 2 years ago
- An example of SparkConnect extension.☆15Mar 5, 2024Updated 2 years ago
- A collection of useful and awesome Databricks resources☆19Dec 21, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆13Apr 24, 2023Updated 2 years ago
- A python SPark ETL libRary (SPETLR) for Databricks. https://discord.gg/p9bzqGybVW☆24Mar 3, 2026Updated last month
- A Python PySpark Projet with Poetry☆29Feb 17, 2026Updated last month
- Bayesian experiments for football insights☆15Jan 4, 2026Updated 3 months ago
- Trino On K8S Via Helm & Metastore Workshop Querying Delta Tables☆12Jan 27, 2025Updated last year
- Allow parsing Russian receipts☆54Aug 14, 2020Updated 5 years ago
- optasoccer is a Python library for reading opta soccer data☆11Mar 14, 2024Updated 2 years ago
- ☆20Updated this week
- A DuckDB extension to choose file interactively using native file open dialogs☆15Mar 30, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Unofficial implementation of QaNER: Prompting Question Answering Models for Few-shot Named Entity Recognition.☆64Oct 15, 2022Updated 3 years ago
- ☆17Oct 12, 2023Updated 2 years ago
- Implementation of core-expansion algorithm☆11Jan 26, 2026Updated 2 months ago
- A cross-platform command-line tool for effortlessly installing binaries from GitHub releases and other sources.☆38Mar 6, 2026Updated last month
- Atomix Jepsen tests☆14Feb 7, 2017Updated 9 years ago
- nosqlapi is a library for building standard NOSQL python libraries.☆12Apr 5, 2022Updated 4 years ago
- A simple Java thread dump visualisation and analysis tool.☆12Jan 29, 2016Updated 10 years ago
- Redash plugin for Apache Kylin integration☆12Mar 21, 2018Updated 8 years ago
- ☆10Jul 1, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Notebooks to learn Databricks Lakehouse Platform☆42Updated this week
- A Python Library to support running data quality rules while the spark job is running⚡☆201Updated this week
- This package contains the grammar in ANTLR g4 format and Java parser for the Data Quality Definition Language (DQDL), used by AWS Glue Da…☆22Mar 26, 2026Updated 2 weeks ago
- A high-performance, in-memory, git-backed OLAP database (of nothing).☆13Jan 23, 2025Updated last year
- Metric Learning Library for Keras☆10Apr 24, 2019Updated 6 years ago
- Incan: a modern, Pythonic language that compiles to Rust! Type-safe, async-friendly, with fixtures, testing, and web/inter-op built in.☆16Updated this week
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆95May 9, 2025Updated 11 months ago
- Visits sessionization pipeline used for the talk☆13May 28, 2024Updated last year
- Particle collision with quad-tree experiment inspired by games like Eufloria and Auralux.☆12Oct 30, 2020Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Pytorch Implementation: Annealing Genetic GAN for Minority Oversampling (BMVC 2020)☆10Aug 5, 2020Updated 5 years ago
- Sangria akka-streams integration☆11Feb 8, 2026Updated 2 months ago
- A Databricks framework for quick Agent solutions☆24Apr 14, 2024Updated last year
- Geospatial visualization for Apache Zeppelin using the Leaflet map library.☆12Dec 11, 2017Updated 8 years ago
- Find your Claude Code level (0-10) and get a personalized roadmap to the next one. A skill for Claude Code by the GenAI Circle community.☆55Mar 23, 2026Updated 2 weeks ago
- An automation tool to refactor Jupyter Notebooks to Python modules, with code dependency analysis.☆12Feb 22, 2025Updated last year
- Catalyst.Neuro☆20Oct 3, 2023Updated 2 years ago