how to unit test your PySpark code
☆29Mar 26, 2021Updated 5 years ago
Alternatives and similar repositories for unitTestPySpark
Users that are interested in unitTestPySpark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- simple ETL example☆16Jun 1, 2020Updated 5 years ago
- Agent models implemented with Pyro☆11Jul 11, 2023Updated 2 years ago
- Sample Project to Learn Data Engineering☆10Aug 1, 2021Updated 4 years ago
- Sample code for getting started reverse-terraforming Snowflake☆17May 12, 2023Updated 2 years ago
- Template for Data Engineering and Data Pipeline projects☆118Jan 1, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆17May 1, 2023Updated 2 years ago
- Some example projects for Data Engineers to build, end-to-end.☆39Nov 8, 2023Updated 2 years ago
- Hadoop/Hive/Spark container to perform CI tests☆10Dec 26, 2020Updated 5 years ago
- ☆10Nov 28, 2022Updated 3 years ago
- A well-documented explanation of data structure types including Linked List, Hash table, Binary Tree, Queues, Stack☆13Jul 30, 2022Updated 3 years ago
- ☆22Nov 30, 2022Updated 3 years ago
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆24Apr 27, 2023Updated 3 years ago
- Docker Apache Airflow☆13Mar 1, 2023Updated 3 years ago
- ☆10Oct 12, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Jan 24, 2023Updated 3 years ago
- List of `pre-commit` hooks to ensure the quality of your `dbt` projects.☆19Nov 28, 2023Updated 2 years ago
- ☆21Mar 11, 2025Updated last year
- Genie Framework improves Spark Pool utilization by executing multiple Synapse notebooks on the same spark pool instance☆28Dec 19, 2023Updated 2 years ago
- A CALDERA plugin☆18Jul 28, 2020Updated 5 years ago
- A backtest a day keeps the losses away!☆15Sep 11, 2023Updated 2 years ago
- ☆15Apr 8, 2026Updated 3 weeks ago
- Continuous infrastructure drift detection with historical tracking and notifications.☆57Updated this week
- ☆26Jul 9, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Awesome cheatsheets for Data Science☆12Sep 16, 2019Updated 6 years ago
- A lightweight and flexible analysis pipeline☆12Apr 9, 2026Updated 2 weeks ago
- Python project template for Snowpark development☆81Oct 23, 2023Updated 2 years ago
- NeurIPS 2020 Spotlight Paper☆13Dec 20, 2021Updated 4 years ago
- create issues from pytest-reportlog files☆14Feb 10, 2026Updated 2 months ago
- Library for fast text representation and classification.☆10Apr 17, 2022Updated 4 years ago
- This is a custom project for WGU, the original project repo is https://github.com/udacity/nd0821-c2-build-model-workflow-starter☆12Feb 1, 2026Updated 2 months ago
- Design and implementation of FAIR Data Cube☆11Jun 2, 2025Updated 10 months ago
- LLM Building Blocks for Python Course☆17Nov 17, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Repository for participants of the "Containers for HPC" training☆11Apr 9, 2026Updated 2 weeks ago
- NIU website on common software problems and their troubleshooting☆10Feb 13, 2026Updated 2 months ago
- mlmodels : Machine Learning and Deep Learning Model ZOO for Pytorch, Tensorflow, Keras, Gluon models...☆10Oct 23, 2020Updated 5 years ago
- Furnace is a high-performance quantitative trading library that provides features similar to CCXT, allowing developers to connect and int…☆14Jan 16, 2025Updated last year
- Collection of Snowflake Stored Procedures and UDFs that leverage Python☆21Sep 4, 2023Updated 2 years ago
- This project looks at creating a controlled vocabulary for DICOM Pt 6 Data Dictionary with a focus on CS code strings.☆12Jan 9, 2026Updated 3 months ago
- The AI Alliance project to define a reference stack for AI model and system evaluation, with evaluations, benchmarks, and leaderboards.☆13Apr 6, 2026Updated 3 weeks ago