A python SPark ETL libRary (SPETLR) for Databricks. https://discord.gg/p9bzqGybVW
☆24Mar 3, 2026Updated 3 months ago
Alternatives and similar repositories for spetlr
Users that are interested in spetlr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Semantic Functions for Semantic Link☆15Apr 29, 2026Updated last month
- Streaming demo dbt☆17Sep 17, 2024Updated last year
- An automation tool to refactor Jupyter Notebooks to Python modules, with code dependency analysis.☆12Feb 22, 2025Updated last year
- An efficient algorithm for k-bounded (Damerau-)Levenshtein distance☆16Oct 13, 2018Updated 7 years ago
- Yet Another (Spark) ETL Framework☆21Oct 21, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An SBT Plugin that acts as a light wrapper around Buf.☆10Oct 29, 2024Updated last year
- ☆76Jun 12, 2024Updated 2 years ago
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated 4 months ago
- Repository for code samples from the book Mastering Azure Analytics☆25Apr 10, 2017Updated 9 years ago
- Abstractions for feature engineering on large graphs of tabular data.☆26May 18, 2026Updated 3 weeks ago
- Trino On K8S Via Helm & Metastore Workshop Querying Delta Tables☆12Jan 27, 2025Updated last year
- ☆21May 26, 2026Updated 2 weeks ago
- A DuckDB extension to choose file interactively using native file open dialogs☆15May 27, 2026Updated 2 weeks ago
- A cookbook of sample SatchelJS code☆16Jan 3, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Cl app / pre-commit hook to clean Jupyter Notebooks metadata, execution_count and optionally output.☆11Mar 3, 2025Updated last year
- Convert MUSE from TensorFlow to PyTorch and ONNX☆11May 22, 2024Updated 2 years ago
- pyspark framework☆25Feb 22, 2022Updated 4 years ago
- Implementation of core-expansion algorithm☆11Jan 26, 2026Updated 4 months ago
- ✨ A Pydantic to PySpark schema library☆127May 24, 2026Updated 3 weeks ago
- various airflow plugins☆19Dec 26, 2022Updated 3 years ago
- A project to design a fact and dimension star schema for optimizing queries on a flight booking database using PostgreSQL, a relational d…☆12Aug 15, 2021Updated 4 years ago
- nosqlapi is a library for building standard NOSQL python libraries.☆12Apr 5, 2022Updated 4 years ago
- Bulk rename files with your favourite editor☆16Nov 12, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆25May 1, 2024Updated 2 years ago
- This package contains the grammar in ANTLR g4 format and Java parser for the Data Quality Definition Language (DQDL), used by AWS Glue Da…☆23May 19, 2026Updated 3 weeks ago
- A high-performance, in-memory, git-backed OLAP database (of nothing).☆12Jan 23, 2025Updated last year
- Incan: a modern, Pythonic language that compiles to Rust! Type-safe, async-friendly, with fixtures, testing, and web/inter-op built in.☆28Updated this week
- Sample app for a Python API using FastAPI and neomodel☆12Jul 1, 2024Updated last year
- Visits sessionization pipeline used for the talk☆13May 28, 2024Updated 2 years ago
- Pytorch Implementation: Annealing Genetic GAN for Minority Oversampling (BMVC 2020)☆10Aug 5, 2020Updated 5 years ago
- Common Lisp New Language Reference☆22Mar 11, 2026Updated 3 months ago
- Emacs interface to multitran.com☆18Feb 6, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Python Library to support running data quality rules while the spark job is running⚡☆202May 19, 2026Updated 3 weeks ago
- ☆26Apr 3, 2024Updated 2 years ago
- Lightning-fast data validation for Rust. Built on Arrow/DataFusion with OpenTelemetry observability.☆31Mar 30, 2026Updated 2 months ago
- A seq2seq with attention dialogue/MT model implemented by TensorFlow.☆11Jul 17, 2018Updated 7 years ago
- A few hacky Python TensorFlow scripts that make frames for a video using Deep Dream☆10Oct 21, 2016Updated 9 years ago
- Universal Character Recognizer (UCR): Simple, Intuitive, Extensible, Multi-Lingual OCR engine☆15Apr 23, 2021Updated 5 years ago
- ☆26Feb 22, 2026Updated 3 months ago