Spark DataFrame transformation and UDF test examples
☆22Feb 13, 2023Updated 3 years ago
Alternatives and similar repositories for spark-test-example
Users that are interested in spark-test-example are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆455Apr 2, 2026Updated 2 weeks ago
- http4s integration with fs2-data☆12Updated this week
- 2019 aws summit workshop content☆13Jul 2, 2019Updated 6 years ago
- sbt plugin to detect Akka module mismatches and fail build☆10Sep 15, 2025Updated 7 months ago
- Will come later...☆20Jul 1, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Generates dummy data for Reaction Commerce☆10Jan 20, 2021Updated 5 years ago
- Database plugins☆13Apr 6, 2026Updated last week
- ☆14Jul 14, 2022Updated 3 years ago
- Benchmarking suite for Apache Spark☆15Nov 24, 2017Updated 8 years ago
- Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API in…☆19Aug 16, 2019Updated 6 years ago
- Notes on "Data Science from Scratch" by Joel Grus☆11Aug 9, 2016Updated 9 years ago
- Rust-native, modular platform for Semantic Web, SPARQL 1.2, GraphQL, and AI-augmented reasoning☆50Mar 28, 2026Updated 2 weeks ago
- Source code of the programs as values presentation☆20Aug 3, 2022Updated 3 years ago
- A library to query heterogeneous data sources uniformly using SPARQL☆12Dec 5, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Template for Spark Projects☆103May 21, 2024Updated last year
- 离线调度, hive, 任务依赖, 任务调度, 大数据开发平台☆14May 10, 2018Updated 7 years ago
- ☆21Sep 27, 2022Updated 3 years ago
- Statically analyze sources and extract information about called or exported library functions in Python applications☆21Apr 25, 2024Updated last year
- Utility library for Vert.X that allows using strong-typed interfaces in communication through EventBus☆15May 24, 2023Updated 2 years ago
- The ISC Anomaly Detection and Classification Framework implemented for Apache Flink.☆13Dec 14, 2016Updated 9 years ago
- Usage examples for byte-genie API☆12Apr 27, 2024Updated last year
- Contains the content for Tableau's OSS contribution guidelines☆11Nov 24, 2025Updated 4 months ago
- Source code of the programs as values presentation☆22Oct 22, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Thin-client metrics library for use with Atlas and SpectatorD☆30Apr 7, 2026Updated last week
- Layout & typography for LaTeX books using the memoir document class☆10Aug 22, 2024Updated last year
- Some AWS EMR examples☆16Jan 18, 2018Updated 8 years ago
- Data warehouse tech stack with PostgreSQL, DBT and Airflow☆20Dec 29, 2025Updated 3 months ago
- An example Task Manager project that has been created using Lagom.☆18Mar 22, 2019Updated 7 years ago
- DIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics control framework that can be used to monitor, log, aud…☆29Feb 14, 2026Updated 2 months ago
- Scala etcd client implementing V3 APIs☆31Feb 15, 2026Updated 2 months ago
- sample oozie workflows☆17Jun 13, 2017Updated 8 years ago
- Spark Streaming ETL jobs for Mozilla Telemetry☆18Dec 5, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Details about the rust-embedded-community project☆15Jun 25, 2025Updated 9 months ago
- SIXX is an XML serializer/deserializer written in Smalltalk. The purpose is to store and load Smalltalk objects in a portable, dialect-in…☆15May 18, 2025Updated 10 months ago
- Uses Cloud Build to deploy a scalable batch ingestion pipeline consisting of GCS, Cloud Functions, Dataflow and BigQuery☆22Dec 7, 2022Updated 3 years ago
- Microsoft Azure PaaS 인 WebApp 에 Django 배포☆10Jul 26, 2016Updated 9 years ago
- DDD-centric event-sourcing library for the JVM☆16Apr 30, 2023Updated 2 years ago
- 🤞 A quick demonstration on how to promisify Chrome Extension APIs with ease!☆12Mar 10, 2019Updated 7 years ago
- ODQA Baseline 팀프로젝트 이슈/정보 저장용 레포입니다.☆12May 22, 2021Updated 4 years ago