SparkConnect Server plugin and protobuf messages for the Amazon Deequ Data Quality Engine.
☆26Feb 22, 2025Updated last year
Alternatives and similar repositories for tsumugi-spark
Users that are interested in tsumugi-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A flake8 plugin that detects of usage withColumn in a loop or inside reduce☆28Jun 20, 2025Updated 11 months ago
- Cl app / pre-commit hook to clean Jupyter Notebooks metadata, execution_count and optionally output.☆11Mar 3, 2025Updated last year
- Delta lake and filesystem helper methods☆51Feb 29, 2024Updated 2 years ago
- ☆23Oct 8, 2018Updated 7 years ago
- An example of SparkConnect extension.☆15Mar 5, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Unity Catalog Explorer is a TypeScript + Next.js based Web UI for the Unity Catalog OSS.☆13Jun 29, 2024Updated last year
- An SBT Plugin that acts as a light wrapper around Buf.☆10Oct 29, 2024Updated last year
- A python SPark ETL libRary (SPETLR) for Databricks. https://discord.gg/p9bzqGybVW☆24Mar 3, 2026Updated 3 months ago
- A Python PySpark Projet with Poetry☆31May 2, 2026Updated last month
- Reproducible Research in Finse☆10Aug 5, 2020Updated 5 years ago
- ☆14Nov 24, 2025Updated 6 months ago
- Allow parsing Russian receipts☆54Aug 14, 2020Updated 5 years ago
- A DuckDB extension to choose file interactively using native file open dialogs☆15May 27, 2026Updated 2 weeks ago
- Example code representing a real-life use case for using {arrow} to improve a Shiny application☆17Jul 6, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆17Oct 12, 2023Updated 2 years ago
- Browse GitHub repos without cloning☆61Jun 1, 2026Updated last week
- [ECML-PKDD 2025] Official Implementation of "Trajectory Imputation in Multi-Agent Sports with Derivative-Accumulating Self-Ensemble".☆14Jun 20, 2025Updated 11 months ago
- This is a Model Context Protocol (MCP) server that provides professional cycling data from FirstCycling. It allows you to retrieve inform…☆18Aug 26, 2025Updated 9 months ago
- Atomix Jepsen tests☆14Feb 7, 2017Updated 9 years ago
- nosqlapi is a library for building standard NOSQL python libraries.☆12Apr 5, 2022Updated 4 years ago
- Bulk rename files with your favourite editor☆16Nov 12, 2025Updated 6 months ago
- Our public repo ranked 1st 🏆🏆 at MMSports2023 challenge on segmentation task☆16Oct 31, 2023Updated 2 years ago
- Redash plugin for Apache Kylin integration☆12Mar 21, 2018Updated 8 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- This package contains the grammar in ANTLR g4 format and Java parser for the Data Quality Definition Language (DQDL), used by AWS Glue Da…☆23May 19, 2026Updated 3 weeks ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆96May 11, 2026Updated 3 weeks ago
- ☆23Aug 26, 2024Updated last year
- Particle collision with quad-tree experiment inspired by games like Eufloria and Auralux.☆13Oct 30, 2020Updated 5 years ago
- Visits sessionization pipeline used for the talk☆13May 28, 2024Updated 2 years ago
- Pytorch Implementation: Annealing Genetic GAN for Minority Oversampling (BMVC 2020)☆10Aug 5, 2020Updated 5 years ago
- Find your Claude Code level (0-10) and get a personalized roadmap to the next one. A skill for Claude Code by the GenAI Circle community.☆63Mar 23, 2026Updated 2 months ago
- Sangria akka-streams integration☆11Apr 18, 2026Updated last month
- A Databricks framework for quick Agent solutions☆23Apr 14, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Geospatial visualization for Apache Zeppelin using the Leaflet map library.☆12Dec 11, 2017Updated 8 years ago
- An automation tool to refactor Jupyter Notebooks to Python modules, with code dependency analysis.☆12Feb 22, 2025Updated last year
- Common Lisp New Language Reference☆21Mar 11, 2026Updated 2 months ago
- Emacs interface to multitran.com☆18Feb 6, 2024Updated 2 years ago
- Python client for Radarly API☆10Aug 3, 2023Updated 2 years ago
- Custom otelcol-contrib with exporter to telegram. And handler for loguru☆14Apr 8, 2025Updated last year
- Universal Character Recognizer (UCR): Simple, Intuitive, Extensible, Multi-Lingual OCR engine☆15Apr 23, 2021Updated 5 years ago