Example of how to leverage Apache Spark distributed capabilities to call REST-API using a UDF
☆51Oct 11, 2022Updated 3 years ago
Alternatives and similar repositories for Spark-REST-API-UDF
Users that are interested in Spark-REST-API-UDF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python code that will collapse structured columns separating out the attributes into new columns☆10Mar 15, 2022Updated 4 years ago
- How to execute a REST API call in Apache Spark the right way, using Scala☆19Oct 11, 2022Updated 3 years ago
- Stream Data from Databricks Directly to PowerBI, and CosmosDB!☆12Sep 25, 2018Updated 7 years ago
- ☆10Jul 27, 2021Updated 4 years ago
- dagster scikit-learn pipeline example.☆46Mar 18, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆17Dec 9, 2022Updated 3 years ago
- Template for Scala Spark with Unit Test☆13Jul 24, 2023Updated 2 years ago
- A local copy of the jquery.wrapSelection plugin, which was not authored by me, but looks like it is abandoned.☆18Oct 11, 2014Updated 11 years ago
- ☆23Jun 3, 2021Updated 4 years ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Jan 20, 2023Updated 3 years ago
- Some recipes for data engineering with Python☆25Mar 23, 2021Updated 5 years ago
- Power BI REST API function wrappers for sending Spark data to Power BI Push Datasets☆15Apr 22, 2019Updated 7 years ago
- ☆13Oct 21, 2015Updated 10 years ago
- Google Developers Student Club - Data Science Bootcamp 2022☆11May 18, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)☆12May 2, 2021Updated 4 years ago
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆27Mar 25, 2024Updated 2 years ago
- ☆41Jan 24, 2023Updated 3 years ago
- Python API for Deequ☆817Updated this week
- Steve's coffee shop recipe project for the Pluralsight Course "Git Fundamentals"☆20Mar 13, 2023Updated 3 years ago
- Speak Slack notifications and process Slack slash commands☆15Dec 20, 2018Updated 7 years ago
- Example code for the dbt core Learn tutorial. The Astro dbt provider, also known as Cosmos, is a tool automatically integrate dbt models …☆17Mar 7, 2025Updated last year
- A library for building data extraction jobs for ETH events☆18Jun 17, 2024Updated last year
- ☆16Jan 19, 2022Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Dataproc Scala Examples is an effort to assist in the creation of Spark jobs written in Scala to run on Dataproc.☆12Mar 26, 2026Updated last month
- a quick how-to on creating a library of custom Python functions for use in Databricks☆26Jul 10, 2020Updated 5 years ago
- SparkConnect Server plugin and protobuf messages for the Amazon Deequ Data Quality Engine.☆26Feb 22, 2025Updated last year
- Data sources for Elastic Map Service☆23Apr 20, 2026Updated last week
- Este é um projeto de exemplo que demonstra um processo de ETL (Extração, Transformação e Carga) de dados usando Python, Polars e AWS Loca…☆15Sep 25, 2023Updated 2 years ago
- learning logstash and elastic search plugins☆21Jul 15, 2022Updated 3 years ago
- ☆30Feb 5, 2023Updated 3 years ago
- Project utilising data from the Age of Empires api at 'https://aoestats.io'☆54Dec 8, 2024Updated last year
- Rock Solid Python with Type Hints Course Student Materials☆25Jul 8, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Apr 2, 2022Updated 4 years ago
- A benchmark for serverless analytic databases.☆26Jan 23, 2026Updated 3 months ago
- MCP Server for Apache Airflow☆32Oct 14, 2025Updated 6 months ago
- Distributed data sync using trimerge☆11Mar 26, 2024Updated 2 years ago
- Delta Lake helper methods in PySpark☆328Jan 19, 2026Updated 3 months ago
- dbt Cloud pipelines in airflow examples☆37Oct 30, 2023Updated 2 years ago
- ☆16Jun 24, 2023Updated 2 years ago