Example Repo to have full end to end pyspark testing via docker-compose
☆33Feb 6, 2023Updated 3 years ago
Alternatives and similar repositories for pyspark-testing-env
Users that are interested in pyspark-testing-env are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 25+ end-to-end ml case studies in python - classification, NLP, clustering, CNNs, SQL.☆18Mar 29, 2025Updated last year
- ☆13Sep 2, 2024Updated last year
- Data Engineer Roadmaps as Projects Funnel☆12Aug 10, 2022Updated 3 years ago
- Turn browser clicks into reproducible scraping code.☆11Oct 27, 2024Updated last year
- Data science, machine learning tools on the cloud☆15Jan 13, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Data pipeline project using Data Factory, Databricks and Cosmosdb Graph, deployed using Azure DevOps, secured using firewalls and Azure A…☆11Dec 14, 2022Updated 3 years ago
- TensorFlow implementation of the "Prompt-to-Prompt Image Editing with Cross Attention Control" for Stable Diffusion☆15Mar 25, 2023Updated 3 years ago
- Deploy a scikit model using heroku and Flask☆15May 1, 2023Updated 3 years ago
- Samples for fabric user data functions☆28Jun 18, 2026Updated last week
- Match your fig size and font to conference formats.☆11Aug 16, 2021Updated 4 years ago
- Example project for building scalable data pipelines with Kedro and Ibis.☆14Dec 10, 2025Updated 6 months ago
- For use receiving email via SES, delivers email to S3, indexes mailboxes in DDB and broadcasts rich inbound email events via EventBridge.☆24Jun 15, 2026Updated 2 weeks ago
- A pyproject.toml conversion tool for Poetry to uv migration☆20Dec 28, 2024Updated last year
- A cloud data platform product to accelerate time to insights. Our open-source framework is designed for the real world. Stripping away th…☆25Jun 11, 2026Updated 2 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation an…☆24Nov 21, 2023Updated 2 years ago
- A simple script designed to run and use i2p and i2pd on tails os along with the tor network!☆22May 19, 2025Updated last year
- HIVE: Evaluating the Human Interpretability of Visual Explanations (ECCV 2022)☆22Jan 19, 2023Updated 3 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆29Nov 24, 2022Updated 3 years ago
- Repository of notebooks and related collateral used in the Databricks Demo Hub, showing how to use Databricks, Delta Lake, MLflow, and mo…☆26May 27, 2021Updated 5 years ago
- Enrolled in DataTalks Zoomcamp https://github.com/DataTalksClub/mlops-zoomcamp☆20Jun 27, 2022Updated 4 years ago
- Lightweight container-based desktop compartmentalization.☆32May 7, 2026Updated last month
- ☆18May 22, 2024Updated 2 years ago
- Making Time Speak! 🎙️☆29May 30, 2026Updated 3 weeks ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Streamlit template for building SMART on FHIR apps in the Cerner ecosystem.☆11Sep 22, 2023Updated 2 years ago
- ☆22Apr 10, 2017Updated 9 years ago
- Utility functions to support analytics over FHIR in BigQuery or Apache Spark☆15Jan 8, 2024Updated 2 years ago
- For Udemy students: the official repository of Rock the JVM's Spark Streaming course☆26Jan 5, 2023Updated 3 years ago
- Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DocumentDB as the database☆14Oct 26, 2021Updated 4 years ago
- ☆13Apr 8, 2023Updated 3 years ago
- Chapter 8 of the AWS Cookbook☆12Apr 20, 2023Updated 3 years ago
- ☆12Feb 23, 2024Updated 2 years ago
- Sample repository to demonstrate Terraform module versioning using semantic-release.☆18Jul 5, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆45Sep 27, 2018Updated 7 years ago
- Alpine-based multistage-build version of Python Black for reproducible usage in CI☆24Sep 5, 2023Updated 2 years ago
- ☆24Jul 16, 2024Updated last year
- Homeworks repository for the Big Data Analysis with Scala and Spark Coursera course☆15Jun 30, 2024Updated last year
- Singer.io transformation component between Taps and Targets - PipelineWise compatible☆20Sep 20, 2024Updated last year
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 3 years ago
- ☆26Sep 27, 2022Updated 3 years ago