Example of how to leverage Apache Spark distributed capabilities to call REST-API using a UDF
☆51Oct 11, 2022Updated 3 years ago
Alternatives and similar repositories for Spark-REST-API-UDF
Users that are interested in Spark-REST-API-UDF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Project repository of Apache Airflow, deployed on Docker in Amazon EC2 via GitLab.☆15Sep 3, 2021Updated 4 years ago
- A Python PySpark Projet with Poetry☆31May 2, 2026Updated 2 weeks ago
- Concurrently list Amazon S3 bucket☆38Apr 20, 2025Updated last year
- A Python library for the Alation REST APIs.☆10May 11, 2026Updated last week
- ☆10Jul 27, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Get introduced to Directed Acyclic Graphs (DAGs) through Dagster with a simple ML program☆13Apr 19, 2023Updated 3 years ago
- Next word prediction based on N-gram language model☆12Jan 11, 2015Updated 11 years ago
- ☆17Dec 9, 2022Updated 3 years ago
- Template for Scala Spark with Unit Test☆13Jul 24, 2023Updated 2 years ago
- Code for a tutorial for basic concepts working with Akka using Scala.☆21Mar 29, 2013Updated 13 years ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Jan 20, 2023Updated 3 years ago
- Some recipes for data engineering with Python☆25Mar 23, 2021Updated 5 years ago
- ☆13Oct 21, 2015Updated 10 years ago
- A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)☆12May 2, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is a collection of simple asynchronous RESToverHTTP and JSON-RPCoverWebSocket examples of how to interact with a few Crypto Exchange…☆11May 30, 2024Updated last year
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆27Mar 25, 2024Updated 2 years ago
- Python API for Deequ☆820May 9, 2026Updated last week
- Steve's coffee shop recipe project for the Pluralsight Course "Git Fundamentals"☆21Mar 13, 2023Updated 3 years ago
- Speak Slack notifications and process Slack slash commands☆15Dec 20, 2018Updated 7 years ago
- Example code for the dbt core Learn tutorial. The Astro dbt provider, also known as Cosmos, is a tool automatically integrate dbt models …☆17Mar 7, 2025Updated last year
- ☆16Jan 19, 2022Updated 4 years ago
- Python wrapper for the Open Brewery DB API☆16Mar 7, 2024Updated 2 years ago
- ☆14Mar 30, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- a quick how-to on creating a library of custom Python functions for use in Databricks☆26Jul 10, 2020Updated 5 years ago
- SparkConnect Server plugin and protobuf messages for the Amazon Deequ Data Quality Engine.☆26Feb 22, 2025Updated last year
- A micro cluster lab to experiment Dask and Spark (Python and Scala) based on Docker☆16Mar 7, 2023Updated 3 years ago
- learning logstash and elastic search plugins☆21Jul 15, 2022Updated 3 years ago
- Use Celery (an asynchronous task queue) with a schedule to read a file and print☆12Sep 10, 2021Updated 4 years ago
- ☆30Feb 5, 2023Updated 3 years ago
- small configuration for the home server.☆24Dec 27, 2022Updated 3 years ago
- Project utilising data from the Age of Empires api at 'https://aoestats.io'☆54Dec 8, 2024Updated last year
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆22Feb 3, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Source Code for the video series on developing a pushups logger web application with CRUD and user authentication features using Flask.☆29Sep 23, 2022Updated 3 years ago
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Apr 2, 2022Updated 4 years ago
- A benchmark for serverless analytic databases.☆26Jan 23, 2026Updated 3 months ago
- ☆13Jul 15, 2022Updated 3 years ago
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆20Aug 12, 2025Updated 9 months ago
- Distributed data sync using trimerge☆11Mar 26, 2024Updated 2 years ago
- Code files for Mastering JBoss Drools 6, published by Packt☆11Sep 12, 2023Updated 2 years ago