mayur2810/sope

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mayur2810/sope)

mayur2810 / sope

Apache Spark ETL Utilities

☆40

Alternatives and similar repositories for sope

Users that are interested in sope are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AdamPaternostro / Azure-Databricks-Log4J-To-AppInsights
View on GitHub
Connect your Spark Databricks clusters Log4J output to the Application Insights Appender
☆19Aug 4, 2020Updated 5 years ago
AbsaOSS / spark-hofs
View on GitHub
Scala API for Apache Spark SQL high-order functions
☆15Aug 4, 2023Updated 2 years ago
ZuInnoTe / spark-hadoopoffice-ds
View on GitHub
A Spark datasource for the HadoopOffice library
☆36Sep 29, 2025Updated 9 months ago
hammerlab / spark-util
View on GitHub
low-level helpers for Apache Spark libraries and tests
☆16Dec 29, 2018Updated 7 years ago
YotpoLtd / metorikku
View on GitHub
A simplified, lightweight ETL Framework based on Apache Spark
☆588Jan 24, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ververica / lab-sql-vs-datastream
View on GitHub
Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API
☆14Apr 15, 2020Updated 6 years ago
MartijnVisser / flink-only-sql
View on GitHub
Traditionally, engineers were needed to implement business logic via data pipelines before business users can start using it. Using this …
☆12Jul 16, 2026Updated last week
univalence / spark-tools
View on GitHub
☆46Apr 27, 2020Updated 6 years ago
chaohona / redis-proxy
View on GitHub
☆20Dec 30, 2022Updated 3 years ago
techsuppdiva / spark-cheat-sheets
View on GitHub
This repo stores my Spark Tutorial slides.
☆15Feb 8, 2016Updated 10 years ago
drabastomek / learningPySpark_video
View on GitHub
Learning PySpark video series
☆11Mar 5, 2018Updated 8 years ago
Azure-Samples / hdinsight-spark-scala-kafka
View on GitHub
A basic example of how to read and write streaming data using Apache Spark and Kafka on HDInsight
☆13Mar 2, 2023Updated 3 years ago
sohrab- / microservice-simple-example
View on GitHub
Example microservice for Sixtree blog post
☆14Mar 30, 2016Updated 10 years ago
wtog / web-crawler
View on GitHub
web crawler
☆14Sep 27, 2022Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
itinycheng / flink-platform-backend
View on GitHub
Define and schedule workflow, support Flink Jar/SQL, ClickHouse/Hive/Mysql SQL, Shell, etc.
☆22Updated this week
MRobalinho / Object-Detection-Video
View on GitHub
Object Detection Video with TensorFlow
☆14Nov 17, 2018Updated 7 years ago
gvolpe / hll-algorithm-sample
View on GitHub
HLL Algorithm and Web Scraping sample
☆10Sep 29, 2015Updated 10 years ago
FINRAOS / HiveQLUnit
View on GitHub
⚠️ Archived — This repository is no longer maintained and will not receive updates, including security patches. It is preserved in read-o…
☆41Updated this week
sunilsala88 / fyers-files-feb-2024
View on GitHub
☆12Mar 27, 2024Updated 2 years ago
hadooparchitecturebook / clickstream-tutorial
View on GitHub
Code for Tutorial on designing clickstream analytics application using Hadoop
☆54May 20, 2015Updated 11 years ago
big-data-lab-team / accident-prediction-montreal
View on GitHub
☆12Dec 8, 2022Updated 3 years ago
entechlog / snowflake-examples
View on GitHub
Examples and Quick Starts for Snowflake
☆11Jun 18, 2026Updated last month
aws-samples / amazon-eks-apache-spark-etl-sample
View on GitHub
Spark ETL example processing New York taxi rides public dataset on EKS
☆45Jan 5, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
openzipkin-contrib / brave-akka
View on GitHub
Zipkin tracing instrumentation for Akka
☆10Apr 14, 2026Updated 3 months ago
databricks-solutions / databricks-apps-examples
View on GitHub
Features Databricks Apps examples that are built by Databricks field personnel. Meant to act as points-of-inspiration and points-of-imple…
☆25Jan 19, 2026Updated 6 months ago
rucek / akka-streams-in-practice-java8
View on GitHub
Import data from CSV files to Cassandra using Akka Streams with Java 8
☆21May 19, 2017Updated 9 years ago
AWSCookbook / BigData
View on GitHub
Chapter 7 of the AWS Cookbook
☆12Mar 23, 2022Updated 4 years ago
bankyadam / not-so-bigquery
View on GitHub
An emulator for the Google BigQuery, that can be run locally, backed by PostgreSQL.
☆25Feb 18, 2023Updated 3 years ago
cloudera-labs / envelope
View on GitHub
Build configuration-driven ETL pipelines on Apache Spark
☆162Oct 4, 2022Updated 3 years ago
dyatlov / Expedia-PHP-API
View on GitHub
PHP Wrapper for Expedia API
☆21Mar 6, 2014Updated 12 years ago
smart-data-lake / smart-data-lake
View on GitHub
Smart Automation Tool for building modern Data Lakes and Data Pipelines
☆129Jul 20, 2026Updated last week
NickAkincilar / Snowflake_SelfService_Sandbox_Config
View on GitHub
☆13Feb 16, 2022Updated 4 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
notaryio / notary
View on GitHub
A contracts broker that provides a declarative way of sharing, validating & discovering contracts between multiple projects.
☆17Jan 3, 2018Updated 8 years ago
sabeelkhan99 / Masterclass
View on GitHub
☆16May 19, 2026Updated 2 months ago
metarank / cfor
View on GitHub
Scala cfor macro, like a java for-loop
☆18Aug 12, 2024Updated last year
CrazyCompiler / advance-update
View on GitHub
Advance Update in a Elasticsearch plugin that provides you control over the document update functionality of elasticsearch.
☆18Feb 11, 2018Updated 8 years ago
hammerlab / spark-tests
View on GitHub
Utilities for writing tests that use Apache Spark.
☆24Dec 29, 2018Updated 7 years ago
kokosing / trino-rest
View on GitHub
☆15Nov 13, 2025Updated 8 months ago
databricks-solutions / caspers
View on GitHub
A fully simulated business demo built on Databricks — streaming data, AI agents, and apps you can deploy, explore, and extend.
☆66Updated this week