Spark DataFrame transformation and UDF test examples
☆22Feb 13, 2023Updated 3 years ago
Alternatives and similar repositories for spark-test-example
Users that are interested in spark-test-example are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆455Apr 2, 2026Updated last month
- 2019 aws summit workshop content☆13Jul 2, 2019Updated 6 years ago
- ☆11May 16, 2022Updated 3 years ago
- Slide and notebook used for my talk on vaex at the Pandas summit 2019 @ Lodnon☆11Jun 13, 2019Updated 6 years ago
- ☆13Jan 23, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- sbt plugin to detect Akka module mismatches and fail build☆10Sep 15, 2025Updated 7 months ago
- ☆12Jan 22, 2025Updated last year
- Will come later...☆20Jul 1, 2022Updated 3 years ago
- A facebook for data☆26May 31, 2019Updated 6 years ago
- Algebird's HyperLogLog support for Apache Spark.☆10Jul 20, 2017Updated 8 years ago
- distributed remote code execution engine☆16Apr 6, 2025Updated last year
- ☆14Jul 14, 2022Updated 3 years ago
- Full Implementation of Recommender System in Pytorch (with examples)☆28Sep 2, 2020Updated 5 years ago
- ☆12Jul 10, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- JVM related exercises☆11Jul 16, 2017Updated 8 years ago
- Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API in…☆19Aug 16, 2019Updated 6 years ago
- Source code of the programs as values presentation☆20Aug 3, 2022Updated 3 years ago
- Using JRecord to build a mapred and mapreduce inputformat for HDFS, MAPREDUCE, PIG, HIVE, Spark, ...☆19Dec 7, 2017Updated 8 years ago
- A library to query heterogeneous data sources uniformly using SPARQL☆12Dec 5, 2023Updated 2 years ago
- Template for Spark Projects☆104May 21, 2024Updated last year
- Rust-native, modular platform for Semantic Web, SPARQL 1.2, GraphQL, and AI-augmented reasoning☆57Updated this week
- 离线调度, hive, 任务依赖, 任务调度, 大数据开发平台☆14May 10, 2018Updated 7 years ago
- Utility library for Vert.X that allows using strong-typed interfaces in communication through EventBus☆16May 24, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Demonstrates how to develop an Oozie workflow application and aim's to show-case Oozie's features.☆32Apr 12, 2022Updated 4 years ago
- Mapless is a small framework for storing objects in a key->data fashion (i.e.: noSQL databases) without requiring any kind of object-data…☆10Feb 14, 2020Updated 6 years ago
- Usage examples for byte-genie API☆12Apr 27, 2024Updated 2 years ago
- Thin-client metrics library for use with Atlas and SpectatorD☆30Updated this week
- An Ansible collection for Cloudera Platform for on-premise and cloud Datahubs☆38Aug 26, 2025Updated 8 months ago
- Layout & typography for LaTeX books using the memoir document class☆10Aug 22, 2024Updated last year
- Implementing a domain model using functional programming in Scala.☆26Nov 6, 2020Updated 5 years ago
- Data warehouse tech stack with PostgreSQL, DBT and Airflow☆20Dec 29, 2025Updated 4 months ago
- A non-blocking Yahoo Finance Scala client☆23Oct 29, 2016Updated 9 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A framework for creating user-friendly widgets and tools in Jupyter☆18Oct 10, 2024Updated last year
- An example Task Manager project that has been created using Lagom.☆18Mar 22, 2019Updated 7 years ago
- sample oozie workflows☆17Jun 13, 2017Updated 8 years ago
- Spark Streaming ETL jobs for Mozilla Telemetry☆18Dec 5, 2019Updated 6 years ago
- Creating wine embeddings and using these to produce wine recommendations☆22Aug 20, 2024Updated last year
- SIXX is an XML serializer/deserializer written in Smalltalk. The purpose is to store and load Smalltalk objects in a portable, dialect-in…☆15May 18, 2025Updated 11 months ago
- 🤞 A quick demonstration on how to promisify Chrome Extension APIs with ease!☆12Mar 10, 2019Updated 7 years ago