Custom PySpark Connectors
☆89Mar 2, 2026Updated this week
Alternatives and similar repositories for pyspark-data-sources
Users that are interested in pyspark-data-sources are comparing it to the libraries listed below
Sorting:
- ☆13Feb 22, 2024Updated 2 years ago
- End-to-end Azure Databricks Workspace automation with Azure Pipelines☆23Sep 10, 2023Updated 2 years ago
- Visits sessionization pipeline used for the talk☆13May 28, 2024Updated last year
- Examples of Prompt Engineering, Zero Shot Learning, Few Shot Learning and Retrieval Augmented Generation (RAG) using Hugging Face, Databr…☆16Sep 21, 2023Updated 2 years ago
- Databricks Implementation of the TPC-DI Specification using Traditional Notebooks and/or Delta Live Tables☆94Dec 22, 2025Updated 2 months ago
- A lightweight helper utility which allows developers to do interactive pipeline development by having a unified source code for both DLT …☆50Dec 7, 2022Updated 3 years ago
- Delta Lake Documentation☆53Jun 19, 2024Updated last year
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆228Feb 11, 2026Updated 3 weeks ago
- Proxy solution to run elegant Web UIs or interact with LLMs natively inside databricks notebooks.☆29Sep 24, 2024Updated last year
- Don't Panic. This guide will help you when it feels like the end of the world.☆30Feb 7, 2026Updated last month
- Deploy models quickly to databricks via mlflow based serving infra.☆33Jul 23, 2025Updated 7 months ago
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆79Apr 27, 2025Updated 10 months ago
- Databricks GPU Model Serving Example Scripts☆32Sep 28, 2023Updated 2 years ago
- Demo project for dbt on Databricks☆32Oct 23, 2020Updated 5 years ago
- Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline☆152Aug 14, 2024Updated last year
- 🌩️ The Deep Learning framework based on Lightning☆11Dec 11, 2025Updated 2 months ago
- SQL Queries & Alerts for Databricks System Tables access.audit Logs☆41Jul 24, 2025Updated 7 months ago
- Public Benefits Studio's Document Extractor to automate document data extraction with AI and OCR.☆13Jun 2, 2025Updated 9 months ago
- Configuration system geared towards Python ML projects☆11Apr 30, 2023Updated 2 years ago
- ☆19Updated this week
- this librairy working can communicate with the latest Bestway app version smartspa☆29Jan 28, 2026Updated last month
- Integration of Iceberg table management into Spark SQL☆11Jan 21, 2020Updated 6 years ago
- Application for checking performance of elevator group system in building using simulation method.☆12Nov 9, 2017Updated 8 years ago
- A Table format agnostic data sharing framework☆42Feb 4, 2024Updated 2 years ago
- Collection of Machine Learning Examples for Azure Databricks☆42Nov 11, 2020Updated 5 years ago
- ☆10Dec 10, 2022Updated 3 years ago
- ☆11Nov 10, 2025Updated 3 months ago
- Ansible module which allows to easily add, remove or set mount options in /etc/fstab.☆12Apr 21, 2022Updated 3 years ago
- Prompting Techniques for Attorneys☆14Updated this week
- ☆15Nov 21, 2023Updated 2 years ago
- A Python Library to support running data quality rules while the spark job is running⚡☆200Feb 27, 2026Updated last week
- The CaltechDATA InvenioRDM source code☆10Jan 28, 2026Updated last month
- Prefect integrations with Microsoft Planetary Computer.☆11Jul 15, 2024Updated last year
- RobotFramework All in One installer☆15Updated this week
- A go implementation of the TOSCA Standard from OASIS (YAML version)☆10Mar 6, 2018Updated 8 years ago
- Download GitHub repositories☆12May 10, 2025Updated 9 months ago
- Advanced parsing of structured data using Python's new match statement☆13Jan 15, 2025Updated last year
- Ansible galaxy role for Duo☆14Nov 30, 2015Updated 10 years ago
- A protoc plugin that generates GraphQL execution code from Protocol Buffers☆38Updated this week