rcongiu/spark-udwf-session

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rcongiu/spark-udwf-session)

rcongiu / spark-udwf-session

a spark custom window function example, to generate session IDs

☆19

Alternatives and similar repositories for spark-udwf-session

Users that are interested in spark-udwf-session are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

frankmcsherry / graph-map
View on GitHub
A library for working with mmap'd graph data
☆11Nov 29, 2020Updated 5 years ago
oreillymedia / Learning-Path-Get-Started-with-Natural-Language-Processing-Using-Python-Spark-and-Scala
View on GitHub
Links to example code downloads for Learning Path: Get Started with Natural Language Processing Using Python, Spark, and Scala
☆16Feb 23, 2017Updated 9 years ago
xavient / CDS
View on GitHub
Content Data Store (HDFS/HBase)
☆13Dec 1, 2016Updated 9 years ago
BD2KGenomics / conductor
View on GitHub
Efficient, distributed downloads of large files from S3 to HDFS using Spark.
☆17Apr 26, 2017Updated 9 years ago
ippontech / metrics-spark-reporter
View on GitHub
Dropwizard Metrics reporter for Apache Spark
☆28Dec 22, 2014Updated 11 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bakdata / rebalancing-demo
View on GitHub
Repository that showcases problems with Kafka rebalancing and explains how to fix them. Please visit our blog article to learn what Kafka…
☆12Aug 21, 2020Updated 5 years ago
saagie / spark-or
View on GitHub
Spark Operations Research
☆12Sep 21, 2016Updated 9 years ago
sforteln / HdfsBlockFinder
View on GitHub
Allows you to see where(datanodes) that contain a file in HDFS
☆17Mar 16, 2013Updated 13 years ago
pkoperek / hubert
View on GitHub
Universal gEnetic pRogramming Tool (hUbERT)
☆13Jun 7, 2017Updated 9 years ago
idryanov / 2schematic
View on GitHub
Converts 3D file formats to Minecraft schematics
☆14Mar 8, 2013Updated 13 years ago
Xeeshanmalik / deep_ml_esn
View on GitHub
My Very Own Deep Multiple Layered Echo State Network
☆13Jan 2, 2021Updated 5 years ago
anish749 / spark2-etl-examples
View on GitHub
A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0
☆26Aug 5, 2021Updated 4 years ago
ekg / mmmulti
View on GitHub
memory mapped multimap, multiset, and implicit interval tree based on an in-place parallel sort
☆28Jan 25, 2021Updated 5 years ago
KhronosGroup / NNEF-Docs
View on GitHub
NNEF public repository
☆18Nov 20, 2025Updated 8 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
bbende / nifi-dependency-example
View on GitHub
Demonstrates how to link a processor bundle with a custom controller service.
☆21Aug 9, 2024Updated last year
conda-forge / jaxlib-feedstock
View on GitHub
A conda-smithy repository for jaxlib.
☆17Jul 3, 2026Updated 2 weeks ago
tobilg / docker-predictionio
View on GitHub
Docker container for the latest prediction.io version with most recent dependencies
☆11Jan 6, 2017Updated 9 years ago
vzlatkin / MonteCarloVarUsingRealData
View on GitHub
☆14May 29, 2016Updated 10 years ago
mozilla-services / tigerblood
View on GitHub
Deprecated, use https://github.com/mozilla-services/iprepd
☆15May 18, 2018Updated 8 years ago
EqualExperts / opslogger
View on GitHub
A logging library designed with operations in mind.
☆12Dec 6, 2017Updated 8 years ago
legsem / legstar-core2
View on GitHub
Mainframe COBOL and Java open world integration
☆21Jun 20, 2026Updated last month
flexgp / flexgp
View on GitHub
FlexGP: Flexible ML with Genetic Programming
☆20Feb 17, 2015Updated 11 years ago
TFMV / arrowport
View on GitHub
Arrow Flight Powered DuckDB Bridge
☆15Jun 20, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
abrander / ginproxy
View on GitHub
A very simple proxy handler for gin-gonic
☆12Feb 3, 2016Updated 10 years ago
Paxa / kt
View on GitHub
Docker image for "kt" - kafka command line tool
☆18Aug 18, 2022Updated 3 years ago
sciencepal / dockers
View on GitHub
Code for docker images
☆39Apr 12, 2023Updated 3 years ago
jeff-dale / Gene-Expression-Programming
View on GitHub
Python + Numpy implementation of the Gene Expression Programming Evolutionary Algorithm
☆11Sep 18, 2017Updated 8 years ago
cloudwicklabs / generator
View on GitHub
Synthetic data generators for simulating real-time data and work loads
☆12Nov 6, 2015Updated 10 years ago
FizzerWL / CsScala
View on GitHub
C# to Scala Converter
☆21Jun 27, 2026Updated 3 weeks ago
pac4j / jee-pac4j-demo
View on GitHub
JEE demo to test the jee-pac4j security library
☆19Updated this week
RobertSasak / Prolog-Planning-Library
View on GitHub
Prolog is suitable enviroment for writting planners. However there is no way how to work with PDDL files so far. In this library we provi…
☆20Jul 26, 2021Updated 4 years ago
code-forge-temple / scribe-pal
View on GitHub
ScribePal is an Open Source intelligent browser extension that leverages AI to empower your web experience by providing contextual insigh…
☆22Apr 6, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
inconshreveable / service
View on GitHub
Run go programs as a service on major platforms.
☆16Aug 11, 2015Updated 10 years ago
Brett-Kennedy / AdditiveDecisionTree
View on GitHub
A variation on a standard Decision Tree such as that in sklearn, where nodes may be based on an aggregation of multiple splits.
☆10May 24, 2024Updated 2 years ago
msgis / openmetadata-spatial-connector
View on GitHub
This is a OpenMetadata custom connector to any spatial data format which can be read through fiona (the OGR part of the excellent GDAL li…
☆21Sep 12, 2024Updated last year
victoriano / bluesky-social-graph
View on GitHub
A simple script to download your BlueSky social graph and visualize it in Graphext
☆20Sep 15, 2024Updated last year
oracle / spark-oracle
View on GitHub
On the fly, translation of Spark programs to run natively on your Oracle DB. Your Spark programs require no changes.
☆36Apr 15, 2025Updated last year
mraad / spark-gdb
View on GitHub
A library for parsing and querying an Esri File Geodatabase with Apache Spark.
☆27Nov 13, 2016Updated 9 years ago
force12io / coreos-marathon
View on GitHub
Create a 3 node Marathon / Mesos cluster locally with Vagrant or hosted with Packet
☆20Aug 31, 2015Updated 10 years ago