A curated list of awesome Databricks resources, including Spark
☆22Jun 28, 2024Updated last year
Alternatives and similar repositories for awesome-databricks
Users that are interested in awesome-databricks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A curated list of awesome Snowflake analytic data warehouse learning resources☆23Mar 1, 2021Updated 5 years ago
- Delta Lake Examples☆11Apr 24, 2020Updated 6 years ago
- ☆10Jun 29, 2021Updated 4 years ago
- ☆13Mar 30, 2020Updated 6 years ago
- A repo containing code for a modern Docker + Jenkins CI / CD System☆15Aug 17, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This reference architecture demonstrates the use of AWS Step Functions to orchestrate an Extract Transfer Load (ETL) workflow with AWS La…☆24Jun 16, 2020Updated 6 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆12Jul 16, 2019Updated 6 years ago
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- ☆18Nov 9, 2025Updated 7 months ago
- Deep Learning Implementations☆17Dec 24, 2020Updated 5 years ago
- explore kafka, fs2 and pure functional programming in scala☆34Updated this week
- spark自学手册,包含了例 如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Jul 19, 2023Updated 2 years ago
- This is a collecton of CDK projects to show how to load data from streaming services into Amazon Redshift.☆13Sep 10, 2024Updated last year
- ☆24Apr 22, 2026Updated last month
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- The NYU Data Catalog facilitates researchers’ access to large datasets available either publicly or through institutional or individual l…☆29Nov 12, 2023Updated 2 years ago
- Repository of notebooks and related collateral used in the Databricks Demo Hub, showing how to use Databricks, Delta Lake, MLflow, and mo…☆26May 27, 2021Updated 5 years ago
- Collection of Cloud Formation Templates, Lambda Scripts and sample code required to provision an AWS Data Lake for a ReInvent Lab Exercis…☆26Apr 9, 2019Updated 7 years ago
- ☆45Sep 18, 2024Updated last year
- Scrapy exporter for Big Data formats☆16Mar 10, 2026Updated 3 months ago
- Atomic Scala Book Solutions - for Beginners and first time Functional Programmers☆12Mar 10, 2020Updated 6 years ago
- jenkins config as code, poc☆28Mar 18, 2018Updated 8 years ago
- Scala Academy☆11Jun 2, 2022Updated 4 years ago
- Secure shell command execution MCP server for Claude AI. Enables controlled shell access within specified directories.☆21Aug 19, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Civilian Topographic Map (CTM) product☆16Feb 28, 2025Updated last year
- ansible with kubernetes☆10Feb 14, 2023Updated 3 years ago
- On-demand port forwarding to k8s.☆26Apr 10, 2026Updated 2 months ago
- This a simple Python daemon to monitor your Impala nodes.☆10Apr 13, 2021Updated 5 years ago
- Source code for the website geminibyexample.com which provides simple Python code examples for the Gemini SDK☆24Apr 8, 2025Updated last year
- Examples of Spark 3.0☆45Nov 11, 2020Updated 5 years ago
- Reference Architectures for Apache Spark☆38Jan 23, 2017Updated 9 years ago
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆28Mar 17, 2026Updated 3 months ago
- Metadata Driven Development (m3d) is a cloud and platform agnostic framework for the automated creation, management and governance of dat…☆34May 23, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A woodcut inspired map for city streets.☆11Apr 29, 2015Updated 11 years ago
- Set of Scripts and Documentation to setup Mac as Development Environment☆45Aug 6, 2023Updated 2 years ago
- Spark implementation of Slowly Changing Dimension type 2☆11Jan 8, 2019Updated 7 years ago
- This github repo contains Aurora MySQL and PostgreSQL Labs, Aurora Serverless Lab and Heterogeneous database migration with DMS Labs.☆35Mar 7, 2023Updated 3 years ago
- A downloader middleware to change user-agent of scrapy☆21Apr 13, 2026Updated 2 months ago
- POC for all the stack of big data (kafka, spark, cassandra, hdfs, docker, springboot)☆12Dec 16, 2022Updated 3 years ago
- Distributed Data Systems with Azure Databricks, published by Packt☆12Jan 18, 2023Updated 3 years ago