Delta Lake Documentation
☆53Jun 19, 2024Updated last year
Alternatives and similar repositories for delta-docs
Users that are interested in delta-docs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Delta Lake Website☆26Updated this week
- Delta reader for the Ray open-source toolkit for building ML applications☆45Jan 27, 2024Updated 2 years ago
- Construindo Pipeline de Dados com Astro Python SDK, dbt & Apache Airflow☆10Mar 20, 2024Updated 2 years ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- Genie Framework improves Spark Pool utilization by executing multiple Synapse notebooks on the same spark pool instance☆28Dec 19, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Custom PySpark Connectors☆92Mar 3, 2026Updated 3 weeks ago
- Model Context Protocol (MCP) server to interact with gRPC services using the grpcurl tool☆16Mar 5, 2025Updated last year
- Using WASM to write UDFs in Apache Spark☆12Jun 3, 2024Updated last year
- A containerized approach using Apache Kafka, Spark, Cassandra, Hive, Jupyter, and Docker-compose.☆14Apr 14, 2021Updated 4 years ago
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆428May 5, 2025Updated 10 months ago
- ☆16Apr 1, 2025Updated 11 months ago
- ☆15May 31, 2023Updated 2 years ago
- ☆13Feb 19, 2025Updated last year
- native Go library for Delta Lake☆10Jul 31, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This is a showcase repository for the multi-genie agent solution☆24Feb 22, 2026Updated last month
- A CLI tool to simulate EC2 Spot Instances interruptions using AWS Fault Injection Simulator.☆12May 4, 2022Updated 3 years ago
- Hackerrank, Coursera, other studies☆13Aug 19, 2021Updated 4 years ago
- A Gentle introduction to Machine Learning with Apache Spark☆11Mar 2, 2026Updated 3 weeks ago
- Arrow Flight SQL Server☆131Jun 21, 2025Updated 9 months ago
- BSR's new public API. Currently in development.☆21Jan 26, 2026Updated last month
- Sample scripts to use with Agentic Document Extraction (ADE).☆38Mar 18, 2026Updated last week
- Visits sessionization pipeline used for the talk☆13May 28, 2024Updated last year
- ☆37Jun 8, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆18Aug 6, 2024Updated last year
- PySpark test helper methods with beautiful error messages☆755Feb 25, 2026Updated last month
- streaming eight subreddits from reddit api using kafka producer & spark structured streaming.☆19Mar 18, 2026Updated last week
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆227Mar 11, 2026Updated last week
- GCP Plugin for Gordon: Event-driven Cloud DNS☆12Apr 5, 2023Updated 2 years ago
- Parquet file management in S3 for Athena / Spectrum / Presto partitioning☆22Jan 27, 2025Updated last year
- This repository is all you need to understand how to build Gen AI products or AI agents☆60Feb 4, 2026Updated last month
- Azure Data Factory Cookbook_Second Edition, published by Packt☆19Feb 29, 2024Updated 2 years ago
- ☆22Sep 21, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆61Feb 1, 2025Updated last year
- An incubating Debezium CDC connector for for IBM i (AS/400). Please log issues at https://github.com/debezium/dbz/issues.☆18Updated this week
- ☆12Jul 22, 2025Updated 8 months ago
- Using OpenAI with Databricks SQL for queries in natural language☆22Dec 3, 2025Updated 3 months ago
- ☆13Jan 6, 2026Updated 2 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Mar 9, 2026Updated 2 weeks ago
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Jun 12, 2024Updated last year