johnmuller87/spark-udf

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/johnmuller87/spark-udf)

johnmuller87 / spark-udf

☆34

Alternatives and similar repositories for spark-udf

Users that are interested in spark-udf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

amesar / spark-python-scala-udf
View on GitHub
Demonstrates calling a Scala UDF from Python using spark-submit with an EGG and JAR
☆23Mar 3, 2020Updated 6 years ago
kevenpinto / spark-serverless-repo-example
View on GitHub
☆15Mar 24, 2022Updated 4 years ago
boom-deva / Teaching_Advanced_SQL
View on GitHub
Teaching notes from my Advanced SQL workshops as local lead instructor at General Assembly New York. The first edition was created for th…
☆19Feb 14, 2020Updated 6 years ago
analyticsdurgesh / StreamCommerce-Lakehouse-360
View on GitHub
Production-style real-time e-commerce lakehouse with Kafka, Airflow, Databricks, Medallion architecture, data quality, quarantine, Terraf…
☆31May 30, 2026Updated last month
zheyuan28 / SparkTaskMetrics
View on GitHub
Task Metrics Explorer
☆14Apr 2, 2019Updated 7 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
gyan42 / spark-streaming-playground
View on GitHub
Full Stack Data Science projects centered around Apache Spark Streaming for educational purpose.
☆19May 1, 2023Updated 3 years ago
mcastellin / yt-docker-tricks-examples
View on GitHub
A repository to store example files and projects for my YouTube series **Docker Development Tips & Tricks**
☆13Dec 1, 2021Updated 4 years ago
GoogleCloudPlatform / dataflow-opinion-analysis
View on GitHub
Opinion Analysis of News, Threaded Conversations, and User Generated Content
☆110Sep 19, 2024Updated last year
lucharo / raceplotly
View on GitHub
High level package to make a chart bar plot using plotly.
☆28Nov 16, 2022Updated 3 years ago
oap-project / sql-ds-cache
View on GitHub
Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.
☆37Jan 3, 2023Updated 3 years ago
ngaude / cdiscount
View on GitHub
☆12Oct 6, 2015Updated 10 years ago
joelgrus / fun-with-trump-tweets
View on GitHub
code for Seattle Twitter-Dev Meetup, October 2016
☆13Oct 26, 2016Updated 9 years ago
HyukjinKwon / pyspark-project-example
View on GitHub
A simple example for PySpark based project.
☆11Jun 3, 2016Updated 10 years ago
mihagrabner / Load-Forecasting-tutorial
View on GitHub
Complementary Jupyter notebooks for load forecasting tutorial.
☆12May 28, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
fivetran / dbt_smart_run
View on GitHub
Run your dbt models efficiently using dbt_smart_run
☆17Mar 5, 2025Updated last year
avannaldas / ML-End-to-End
View on GitHub
Bare minimum End-to-End ML application with Flask REST API Prediction Service
☆11Jul 11, 2020Updated 6 years ago
wrannaman / tensorflow-pickup-lines
View on GitHub
A pickup line generator using Tensorflow
☆18Jun 25, 2017Updated 9 years ago
ActivisionGameScience / python-kafka-benchmark
View on GitHub
☆14Jun 15, 2016Updated 10 years ago
Rabia-Shafiq / R_Programming
View on GitHub
This is a repository for any and all code written for the R Programming Coursera course through Johns Hopkins University.
☆29Dec 27, 2015Updated 10 years ago
Mangarella / Kaggle-CreditCardFraud
View on GitHub
Ensemble Learning Techniques Tutorial with Credit Card Fraud
☆10Oct 22, 2017Updated 8 years ago
Spark-clustering-notebook / coliseum
View on GitHub
Project defining the docker image that will support examples of algorithms created in this organization
☆13Oct 22, 2017Updated 8 years ago
mosegui / mahalanobis
View on GitHub
Python package for calculation mahalanobis distances from NumPy arrays
☆14Jun 22, 2022Updated 4 years ago
rohitash-chandra / CMTL_dynamictimeseries
View on GitHub
Coevolutionary Multi-task learning for Dynamic Time Series prediction
☆14Jul 13, 2021Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
hichamjanati / mtw
View on GitHub
Wasserstein regularization for sparse multi-task regression
☆15Jul 26, 2020Updated 5 years ago
code4kunal / eda-python-examples
View on GitHub
Consists all the try outs and assignments in AIML program of great lakes
☆10Jun 11, 2020Updated 6 years ago
WalePhenomenon / MathsForML
View on GitHub
Mathematics for Data Science and Machine Learning Workshop Materials
☆12Mar 11, 2021Updated 5 years ago
adib0073 / Time_Series_Anomaly_Detection
View on GitHub
Effective Approaches for Time Series Anomaly Detection
☆11Jun 6, 2020Updated 6 years ago
rvgramillano / springboard_portfolio
View on GitHub
Portfolio repository for work done in Springboard's Data Science Career Track
☆11Apr 1, 2019Updated 7 years ago
datasci-w266 / 2022-summer-main
View on GitHub
☆10Jul 28, 2022Updated 3 years ago
mids-w205-fund-of-data-eng / docker-images
View on GitHub
docker images for class
☆10Jul 27, 2021Updated 4 years ago
appnexus / logistic-regression-L1
View on GitHub
☆15May 10, 2016Updated 10 years ago
timesler / lr-momentum-scheduler
View on GitHub
Pytorch implementation of arbitrary learning rate and momentum schedules, including the One Cycle Policy
☆12Jul 15, 2020Updated 6 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
aqzhyi / wordpress-posts-crawler
View on GitHub
A crawler for articles of wordpress
☆14Nov 11, 2015Updated 10 years ago
ernest-kiwele / chicago-crime-analysis-apache-spark
View on GitHub
Using Apache Spark SQL, Spark ML, Pandas to analyse and predict using the Chicago crime dataset
☆10Apr 6, 2018Updated 8 years ago
rogeriochaves / notebooks
View on GitHub
I'll munch some data here
☆12Jun 18, 2021Updated 5 years ago
buds-lab / united-world-college-open-data
View on GitHub
An IPython notebook analysis of the UWC Tampines commercial building dataset
☆13Apr 25, 2019Updated 7 years ago
joelparkerhenderson / wordbooks
View on GitHub
Demo wordbooks for business, projects, industries, software, consulting, and more
☆25Apr 14, 2025Updated last year
sanori / unzip-mbcs
View on GitHub
UnZip for non-UTF8 encoding such as cp949, sjis, gbk, euc-kr, euc-jp, and gb2312
☆14Jul 17, 2022Updated 4 years ago
csuzhangxc / Flask-BCS
View on GitHub
百度云存储BCS（Baidu Cloud Storage）Flask扩展，BCS(Baidu Cloud Storage) for Flask
☆11Feb 5, 2015Updated 11 years ago