Example Repo to have full end to end pyspark testing via docker-compose
☆31Feb 6, 2023Updated 3 years ago
Alternatives and similar repositories for pyspark-testing-env
Users that are interested in pyspark-testing-env are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The "What Would Brett Do?" VS Code extension☆15Jul 12, 2023Updated 2 years ago
- Implementation of Boundary Attributions for Normal (Vector) Explanations☆11Aug 13, 2021Updated 4 years ago
- Data Engineer Roadmaps as Projects Funnel☆11Aug 10, 2022Updated 3 years ago
- In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO,…☆11Jun 27, 2023Updated 2 years ago
- A production-ready PySpark project template with medallion architecture, Python packaging, unit tests, integration tests, CI/CD automatio…☆56Feb 20, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Easily import a module and mock its dependencies in an isolated way.☆13May 19, 2022Updated 3 years ago
- Data science, machine learning tools on the cloud☆15Jan 13, 2021Updated 5 years ago
- Data pipeline project using Data Factory, Databricks and Cosmosdb Graph, deployed using Azure DevOps, secured using firewalls and Azure A…☆11Dec 14, 2022Updated 3 years ago
- Deploy a scikit model using heroku and Flask☆15May 1, 2023Updated 2 years ago
- Match your fig size and font to conference formats.☆11Aug 16, 2021Updated 4 years ago
- Example project for building scalable data pipelines with Kedro and Ibis.☆14Dec 10, 2025Updated 3 months ago
- A pyproject.toml conversion tool for Poetry to uv migration☆20Dec 28, 2024Updated last year
- A cloud data platform product to accelerate time to insights. Our open-source framework is designed for the real world. Stripping away th…☆24Apr 1, 2026Updated last week
- A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation an…☆23Nov 21, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- quadipy is a python package to help transform structured data into RDF graph format☆19Apr 14, 2023Updated 2 years ago
- ☆17May 26, 2025Updated 10 months ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆29Nov 24, 2022Updated 3 years ago
- Repository of notebooks and related collateral used in the Databricks Demo Hub, showing how to use Databricks, Delta Lake, MLflow, and mo…☆26May 27, 2021Updated 4 years ago
- Validates that all require statements in a project point to an existing path and are correctly cased.☆20Mar 30, 2014Updated 12 years ago
- Nomad launcher/executor for Dagster☆21Oct 2, 2025Updated 6 months ago
- 🍪 Cookiecutter template for MLOps Project. Based on: https://mlops-guide.github.io/☆28May 7, 2021Updated 4 years ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆286Mar 4, 2026Updated last month
- Making Databricks easy to use for R developers.☆26Oct 6, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Streamlit template for building SMART on FHIR apps in the Cerner ecosystem.☆11Sep 22, 2023Updated 2 years ago
- Making Time Speak! 🎙️☆29Mar 30, 2026Updated last week
- ☆22Apr 10, 2017Updated 8 years ago
- Utility functions to support analytics over FHIR in BigQuery or Apache Spark☆15Jan 8, 2024Updated 2 years ago
- Code repository for the paper "A Deep Adversarial Framework for Visually Explainable Periocular Recognition" - CVPR 2021 Biometrics Works…☆16Feb 7, 2025Updated last year
- This Guidance demonstrates how to transform architecture diagrams into Infrastructure as Code (IaC) templates using AI, addressing the ch…☆40Apr 1, 2026Updated last week
- Demo converting streamlit uber nyc rides to use duckdb☆30Apr 9, 2023Updated 3 years ago
- The official repository for the Rock the JVM Spark Optimization 2 course☆43Dec 4, 2023Updated 2 years ago
- Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DocumentDB as the database☆14Oct 26, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Playing with different packages of the Apache Spark☆30Feb 8, 2026Updated 2 months ago
- ☆13Apr 8, 2023Updated 3 years ago
- Chapter 8 of the AWS Cookbook☆12Apr 20, 2023Updated 2 years ago
- ☆12Feb 23, 2024Updated 2 years ago
- Over 100K open-source YARA signatures evaluated against over 280K files to give insights into the performance of each YARA rule.☆27Dec 13, 2022Updated 3 years ago
- Sample repository to demonstrate Terraform module versioning using semantic-release.☆18Jul 5, 2021Updated 4 years ago
- Homeworks repository for the Big Data Analysis with Scala and Spark Coursera course☆15Jun 30, 2024Updated last year