PastorGL/datacooker-etl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PastorGL/datacooker-etl)

PastorGL / datacooker-etl

ETL processing toolset with SQL-like language and GIS capabilities, built on core Spark. Extensible and modular. REPL included

☆16

Alternatives and similar repositories for datacooker-etl

Users that are interested in datacooker-etl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sleepymast / hugedbbench
View on GitHub
huge list of database benchmark aerospike, clickhouse, cockroach, cockroachdb, cratedb, memsql, mysql, nuodb, postgresql, redis, scylladb…
☆35May 19, 2025Updated last year
n-surkov / PySparkPipeline
View on GitHub
Module for pipelines concept in PySpark
☆16Mar 27, 2024Updated 2 years ago
woltapp / spark-osm-datasource
View on GitHub
Native Spark OSM PBF data source
☆18Apr 3, 2024Updated 2 years ago
MTSWebServices / onetl
View on GitHub
One ETL tool to rule them all
☆89Updated this week
functionaljustin / duct
View on GitHub
DUCT is a Scala 3 category theory and functional programming library
☆15Dec 31, 2025Updated 6 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
larsbaunwall / DomainLang
View on GitHub
A DSL for Domain-Driven Design
☆18Feb 1, 2026Updated 5 months ago
battermann / mcs
View on GitHub
Nested Monte Carlo tree search for SameGame
☆14Sep 4, 2019Updated 6 years ago
skoonData / docker-compose
View on GitHub
☆12Jul 27, 2021Updated 4 years ago
zhanghua19830528 / cim-basic-platform-backend
View on GitHub
CIM基础开发平台后端基于若依框架 BIM+GIS
☆11May 25, 2022Updated 4 years ago
cosminseceleanu / scala-pipeline
View on GitHub
Pipeline Pattern implementation in Scala
☆12Mar 18, 2018Updated 8 years ago
rc-dukes / dash2
View on GitHub
Real-time motion planner and autonomous vehicle simulator in the browser, built with WebGL and Three.js.
☆13Jun 25, 2026Updated last month
jexxa-projects / Jexxa
View on GitHub
Jexxa - A Ports and Adapters Framework for Java
☆14Updated this week
bytedeco / javacpp-embedded-python
View on GitHub
With this library, you can embed Python to your Java or Scala project. The main purpose of this library is to use Python libraries from J…
☆12Aug 25, 2024Updated last year
rockthejvm / spark-cluster-docker
View on GitHub
☆10Mar 12, 2021Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
quarkusio / quarkus-web-lab
View on GitHub
CMS with a Markdown Editor and comments
☆10Sep 9, 2024Updated last year
luque / Notes--Versioning-Event-Sourced-System
View on GitHub
Notes about the "Versioning in an Event Sourced System" book by Greg Young.
☆16Jun 13, 2017Updated 9 years ago
dust-ai-mr / dust-core
View on GitHub
A lightweight Actor framework for Java 21 and above. Library of idiomatic Actors to create and manage millions of cooperating Java threa…
☆28Jul 18, 2026Updated last week
JerckyLY / mapboxgl-measure-tools
View on GitHub
基于mapboxgl、mapboxgl-draw、turf测量控件
☆12Nov 22, 2022Updated 3 years ago
dkandalov / tictactoe4k
View on GitHub
Source code and slides for the tictactoe4k talk
☆12May 2, 2023Updated 3 years ago
polyzos / kafka-streaming-ledger
View on GitHub
☆13Jan 23, 2023Updated 3 years ago
treyhakanson / 2019-pyohio-luigi
View on GitHub
2019 PyOhio talk and code sample on spotify/luigi
☆11Aug 14, 2023Updated 2 years ago
OtusTeam / data-engineer
View on GitHub
☆12May 19, 2021Updated 5 years ago
s7oev / spd
View on GitHub
Systematic Program Design (all 3 parts)
☆11Jun 6, 2020Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
haidaoxiaofei / GPS2RoadNetwork
View on GitHub
Reading list of the topic about utilizing vehicle generated GPS data to update road networks
☆14Jul 18, 2018Updated 8 years ago
boxfuse / cloudwatchlogs-agent
View on GitHub
AWS CloudWatch Logs Agent written in Go with zero runtime dependencies
☆12Oct 7, 2016Updated 9 years ago
DenysMoskalenko / vt2pbf
View on GitHub
☆15Mar 2, 2026Updated 4 months ago
sky-cloak / locke
View on GitHub
Open Source Identity and Access Management. A Keycloak distribution with a Redis cache backend (an alternative to embedded Infinispan).
☆23Jul 10, 2026Updated 2 weeks ago
databricks-industry-solutions / hls-payer-mrf-sparkstreaming
View on GitHub
Spark Structured Streaming for Payer MRF use case
☆15Nov 20, 2025Updated 8 months ago
dmlogv / airflow-tutorial
View on GitHub
Learning resources for Airflow Tutorial article.
☆56Jul 22, 2020Updated 6 years ago
Ananyaiitbhilai / KGViz
View on GitHub
[KGC '24] This application is for visualisation of Knowledge Graphs. We employe a novel technique which uses LLM based agent for triple e…
☆11Apr 17, 2024Updated 2 years ago
rusexpertiza-llc / yupana
View on GitHub
BigData analytics platform
☆19Updated this week
amelinvladimir / clickhouse_course
View on GitHub
☆17Apr 17, 2026Updated 3 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
zekeriyyaa / Traffic-Data-Analysis-with-Apache-Spark-Based-on-Mobile-Robot-Data
View on GitHub
Mobile robot data were analyzed with Apache-Spark to extract five different statistical result such as travel time, waiting time, average…
☆15Apr 5, 2022Updated 4 years ago
fuxiuzhan / fuled-component
View on GitHub
fuled-framework组件包，开箱即用的脚手架，包含注册中心，配置中心，trace，metric，动态线程池，动态kafka，动态redis ....等常见主流功能
☆21May 21, 2026Updated 2 months ago
DanteLore / bdd-pyspark
View on GitHub
Simple demo using "behave" and "pyspark" libraries to test data transformations in a human-readable way
☆10Apr 5, 2019Updated 7 years ago
attogram / shared-media-tagger
View on GitHub
Crowdsourced ratings website for freely licensed images and media from Wikimedia Commons. PHP, SQLite, MediaWiki API, Curators backend.
☆10Jul 12, 2026Updated last week
Amoryan / DracoDemo
View on GitHub
google draco解压drc文件为ply或者obj文件，然后使用opengl渲染解压后的文件，实现3d戒指的展示
☆13Apr 4, 2019Updated 7 years ago
fugerit-org / fj-doc
View on GitHub
Venus - Fugerit Document Generation Framework (fj-doc)
☆30Updated this week
pmauduit / mapnik-java
View on GitHub
Calls libmapnik3.0.x from Java
☆15May 8, 2026Updated 2 months ago