A python SPark ETL libRary (SPETLR) for Databricks. https://discord.gg/p9bzqGybVW
☆24Mar 3, 2026Updated last month
Alternatives and similar repositories for spetlr
Users that are interested in spetlr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PySpark schema generator☆44Feb 23, 2023Updated 3 years ago
- AlgoTree☆16Jan 30, 2026Updated 2 months ago
- An efficient algorithm for k-bounded (Damerau-)Levenshtein distance☆16Oct 13, 2018Updated 7 years ago
- Certificate Transparency stuff☆18Jul 28, 2016Updated 9 years ago
- Yet Another (Spark) ETL Framework☆21Oct 21, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Javascript library to do Event Tracking using Google Analytics or Piwik☆10Mar 15, 2018Updated 8 years ago
- This is a simple script that parses python files in a directory and generates a mxfile containing a diagramm of classes, attributes and m…☆11Feb 23, 2023Updated 3 years ago
- An SBT Plugin that acts as a light wrapper around Buf.☆10Oct 29, 2024Updated last year
- A Nice .NET Wrapper for the MITIE Information Extraction Library☆19Jan 25, 2016Updated 10 years ago
- SparkConnect Server plugin and protobuf messages for the Amazon Deequ Data Quality Engine.☆26Feb 22, 2025Updated last year
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated 2 months ago
- Library for reading the CASC storage format used in World of Warcraft: Warlords of Draenor, Diablo III and Heroes of the Storm.☆14Jan 24, 2017Updated 9 years ago
- Repository for code samples from the book Mastering Azure Analytics☆25Apr 10, 2017Updated 9 years ago
- Abstractions for feature engineering on large graphs of tabular data.☆26Apr 1, 2026Updated 2 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Public repository for .NET DevOps for Azure book manuscript☆26Jan 30, 2021Updated 5 years ago
- Trino On K8S Via Helm & Metastore Workshop Querying Delta Tables☆12Jan 27, 2025Updated last year
- ☆20Apr 3, 2026Updated last week
- A DuckDB extension to choose file interactively using native file open dialogs☆15Mar 30, 2026Updated 2 weeks ago
- Convert MUSE from TensorFlow to PyTorch and ONNX☆11May 22, 2024Updated last year
- Implementation of core-expansion algorithm☆11Jan 26, 2026Updated 2 months ago
- A cross-platform command-line tool for effortlessly installing binaries from GitHub releases and other sources.☆38Mar 6, 2026Updated last month
- Bulk rename files with your favourite editor☆16Nov 12, 2025Updated 5 months ago
- Azure API Management Developer Portal Import and Export PowerShell scripts☆12Jan 19, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A high-performance, in-memory, git-backed OLAP database (of nothing).☆13Jan 23, 2025Updated last year
- A set of Build and Release tasks for Building, Deploying and Testing Databricks notebooks☆28Jul 2, 2024Updated last year
- Incan: a modern, Pythonic language that compiles to Rust! Type-safe, async-friendly, with fixtures, testing, and web/inter-op built in.☆16Apr 7, 2026Updated last week
- Visits sessionization pipeline used for the talk☆13May 28, 2024Updated last year
- Pytorch Implementation: Annealing Genetic GAN for Minority Oversampling (BMVC 2020)☆10Aug 5, 2020Updated 5 years ago
- Find your Claude Code level (0-10) and get a personalized roadmap to the next one. A skill for Claude Code by the GenAI Circle community.☆55Mar 23, 2026Updated 3 weeks ago
- Emacs interface to multitran.com☆18Feb 6, 2024Updated 2 years ago
- ☆27Apr 3, 2024Updated 2 years ago
- Python client for Radarly API☆10Aug 3, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Lightning-fast data validation for Rust. Built on Arrow/DataFusion with OpenTelemetry observability.☆31Mar 30, 2026Updated 2 weeks ago
- Custom otelcol-contrib with exporter to telegram. And handler for loguru☆14Apr 8, 2025Updated last year
- A seq2seq with attention dialogue/MT model implemented by TensorFlow.☆11Jul 17, 2018Updated 7 years ago
- A few hacky Python TensorFlow scripts that make frames for a video using Deep Dream☆10Oct 21, 2016Updated 9 years ago
- High performance Privacy By Design using Matryoshka and Spark talk code☆13May 21, 2019Updated 6 years ago
- ☆26Feb 22, 2026Updated last month
- Spark library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs☆14Jul 31, 2025Updated 8 months ago