jerzygangi/forklift

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jerzygangi/forklift)

jerzygangi / forklift

🚚 ETL for Spark and Airflow

☆25

Alternatives and similar repositories for forklift

Users that are interested in forklift are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bartosz25 / acid-file-formats
View on GitHub
Code for Apache Hudi, Apache Iceberg and Delta Lake analysis
☆10Feb 2, 2024Updated 2 years ago
huydx / fulltext_engine
View on GitHub
simple inverted index full text search engine written in python
☆13Oct 3, 2013Updated 12 years ago
nineinchnick / trino-faker
View on GitHub
Trino plugin that generates fake data
☆14Oct 31, 2024Updated last year
Dataring-engineering / mcp-server-trino
View on GitHub
MCP Server for Trino
☆18Apr 22, 2025Updated last year
adamw / fp-stack-2020-pres
View on GitHub
☆13Dec 12, 2020Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
xavient / Data-Ingestion-Platform
View on GitHub
☆51Updated this week
saumitras / kafka-twitter-docker
View on GitHub
Example to show how to deploy kafka dependent scala microservice with docker
☆15Nov 28, 2017Updated 8 years ago
flowers2023 / spark-streaming
View on GitHub
spark流数据处理，可以从flume-ng，kafka接收数据
☆11Sep 16, 2015Updated 10 years ago
odnoklassniki / spark-to-clickhouse-sink
View on GitHub
☆18Oct 11, 2021Updated 4 years ago
the-pavels / train-station
View on GitHub
Demo application built on top of Apache Pulsar
☆18Feb 8, 2026Updated 5 months ago
marcus-drake / scala-docker
View on GitHub
Docker client for Scala
☆15Sep 1, 2017Updated 8 years ago
aicodes / idea-plugin
View on GitHub
IntelliJ Idea Plugin for AI.codes
☆22Feb 2, 2017Updated 9 years ago
bchevallereau / alfresco-tesseract
View on GitHub
☆12Apr 8, 2018Updated 8 years ago
singer-io / tap-postgres
View on GitHub
tap-postgres
☆67Sep 3, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dhiraa / blockchain-streaming
View on GitHub
Structured Streaming using Apache Spark on Binance Blockchain Stream
☆16May 2, 2018Updated 8 years ago
OlegIlyenko / graalvm-native-image
View on GitHub
GraalVM native-image as a docker container
☆13Oct 11, 2018Updated 7 years ago
canhtran / jav_idol_analysis
View on GitHub
My analysis on Japan Adult Actresses dataset
☆21Sep 29, 2018Updated 7 years ago
minio / presto-minio
View on GitHub
How to use Presto (with Hive metastore) and MinIO?
☆28Mar 8, 2023Updated 3 years ago
calvinlfer / Kafka-Akka-HTTP-and-Akka-Streams-integration
View on GitHub
An example showing how to integrate Apache Kafka with Akka Streams and Akka HTTP.
☆15Sep 28, 2016Updated 9 years ago
mshtelma / spark-structured-streaming-jdbc-sink
View on GitHub
Spark Structured Streaming JDBC Sink
☆16Apr 26, 2021Updated 5 years ago
ansrivas / spark-structured-streaming
View on GitHub
Spark structured streaming with Kafka data source and writing to Cassandra
☆62Dec 5, 2019Updated 6 years ago
pranab / whakapai
View on GitHub
Various Python Data Science Projects available in PyPi
☆28Aug 21, 2024Updated last year
michaelosthege / fairflow
View on GitHub
Functional Airflow DAG definitions.
☆38Jul 4, 2017Updated 9 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
nuxeo-archives / nuxeo-signature
View on GitHub
Digital signature addon for signing PDF files
☆10Apr 10, 2019Updated 7 years ago
OfficeDev / Excel-Add-in-JS-CollegeBudgetTracker
View on GitHub
[ARCHIVED] This task pane add-in shows how to create a college budget tracker using the JavaScript APIs in Excel 2016.
☆11Jul 31, 2019Updated 6 years ago
raykuan / ldap-notes
View on GitHub
LDAP/AD
☆11Sep 21, 2018Updated 7 years ago
univalence / spark-tools
View on GitHub
☆46Apr 27, 2020Updated 6 years ago
Gaz492 / cachet-monitor
View on GitHub
Auto update program for Cachet status pages
☆27Apr 9, 2018Updated 8 years ago
adamcharnock / swiftwind
View on GitHub
User-friendly billing for communal households
☆12Jan 6, 2022Updated 4 years ago
hortonworks-spark / spark-hive-streaming-sink
View on GitHub
A sink to save Spark Structured Streaming DataFrame into Hive table
☆23May 7, 2018Updated 8 years ago
mozillazg / my-blog-file
View on GitHub
my blog post source file(markdown )
☆13Oct 5, 2016Updated 9 years ago
ckan / ckanext-report
View on GitHub
CKAN report infrastructure
☆18Mar 2, 2026Updated 4 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
teamclairvoyant / airflow_demo
View on GitHub
Airflow script for incremental data import from Mysql to Hive using Sqoop.
☆18Jun 6, 2018Updated 8 years ago
mozilla / telemetry-streaming
View on GitHub
Spark Streaming ETL jobs for Mozilla Telemetry
☆18Dec 5, 2019Updated 6 years ago
blaze / datafabric
View on GitHub
A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.
☆13Feb 12, 2016Updated 10 years ago
softprops / cappi
View on GitHub
the sweetest sbt plugin your microbenchmarks will ever meet
☆17Mar 2, 2019Updated 7 years ago
thejunglejane / pynigma
View on GitHub
A Python client for the Enigma API.
☆14Dec 7, 2022Updated 3 years ago
jeremie-lesage / alfresco-docker-cloud
View on GitHub
Main Project of the Alfresco Docker Cloud Project.
☆11Sep 22, 2017Updated 8 years ago
allenai / pipeline
View on GitHub
Library for building reproducible data pipelines to support experimentation
☆20Dec 16, 2015Updated 10 years ago