This project describes how to write full ETL data pipeline using spark.
☆15Oct 15, 2022Updated 3 years ago
Alternatives and similar repositories for spark-data-pipeline
Users that are interested in spark-data-pipeline are comparing it to the libraries listed below
Sorting:
- Kafka Connect connector for receiving data and writing data to Splunk.☆25Nov 7, 2017Updated 8 years ago
- An ETL framework in Scala for Data Engineers☆23Aug 30, 2022Updated 3 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23May 7, 2018Updated 7 years ago
- Create Kafka-Connect clusters with docker . You put the Kafka, we put the Connect.☆25Mar 27, 2019Updated 6 years ago
- ☆63Nov 8, 2019Updated 6 years ago
- Context-aware AI dictionary for books, manga & comics. Neural TTS (Piper), IPA generation, PaddleOCR, multi-word lookup. Supports cloud &…☆19Feb 5, 2026Updated last month
- Simple Spark example of generating table stats for use of data quality checks☆28Apr 28, 2017Updated 8 years ago
- audiofile.cc☆16Jun 27, 2011Updated 14 years ago
- ☆12Jul 13, 2023Updated 2 years ago
- Kafka Sink Connect OrientDB https://www.confluent.io/hub/sanjuthomas/kafka-connect-orientdb☆10Jan 26, 2026Updated last month
- ☆35Feb 6, 2026Updated 3 weeks ago
- Cross-platform toolkit to enhance Claude Code with multi-LLM consensus, 8 specialist agents, semantic knowledge search, and one-command i…☆31Feb 16, 2026Updated 2 weeks ago
- [ARCHIVED] CRUD Product List Provider Hosted ASP.NET MVC App☆13Jun 4, 2019Updated 6 years ago
- PacketZoom SDK for React Native☆11Sep 21, 2018Updated 7 years ago
- This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.☆40Aug 31, 2016Updated 9 years ago
- ☆12Mar 15, 2022Updated 3 years ago
- An experiment for a Node.JS-based WebDAV server☆14Feb 6, 2011Updated 15 years ago
- A set of compound components to make filtering data easier.☆10Oct 3, 2017Updated 8 years ago
- Higher order react component for redux-idle-monitor.☆11Jul 7, 2018Updated 7 years ago
- Example project to show how to use Kafka from Spark Streaming with the Confluent schema registry☆11Aug 17, 2016Updated 9 years ago
- Digital Transformation and Modernization with IBM API Connect, published by Packt☆12Jan 30, 2023Updated 3 years ago
- Firebug-like dir() for Node.js.☆15Apr 9, 2011Updated 14 years ago
- Example repository for integrating Redux with ReactPWA project. This repo demonstrates the usage & integration of Redux in existing React…☆10Nov 13, 2018Updated 7 years ago
- A curated list of awesome Microsoft Azure resources.☆13Apr 16, 2018Updated 7 years ago
- Retail Search with AI☆14Feb 14, 2026Updated 2 weeks ago
- CIM基础开发平台后端 基于若依框架 BIM+GIS☆11May 25, 2022Updated 3 years ago
- Code from the screencasts☆14Jan 13, 2012Updated 14 years ago
- A database with automatic dynamic imputation of missing values.☆11Nov 2, 2017Updated 8 years ago
- Tool to deploy python virtualenvs☆13Mar 24, 2025Updated 11 months ago
- ☆12Jun 3, 2019Updated 6 years ago
- ☆13Aug 22, 2025Updated 6 months ago
- .NET Redis container and strongly typed data objects☆10Jul 9, 2024Updated last year
- Hands-On Data Science for Marketing, published by Packt☆10Apr 4, 2019Updated 6 years ago
- default visualizations that come packaged with the lightning viz notebook☆12Apr 18, 2016Updated 9 years ago
- Port of dwmstatus to Rust☆11Oct 18, 2020Updated 5 years ago
- ChatGPT backend plugin for Backstage. Handles the interaction with OpenAI and exposes an API for the front end plugin☆11Sep 8, 2023Updated 2 years ago
- Simulation of job offers and CVs with real-time processing, classification, and analytics using Kafka, Ray, Spark, and Databricks. Includ…☆14Dec 25, 2024Updated last year
- A Mini-Project to work and understand flutter animations☆16Jun 11, 2020Updated 5 years ago
- Structured Streaming is a reference application showing how to easily integrate structured streaming Apache Spark Structured Streaming, …☆13Nov 17, 2018Updated 7 years ago