sourav-mazumder/Data-Science-Extensions

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sourav-mazumder/Data-Science-Extensions)

sourav-mazumder / Data-Science-Extensions

☆70

Alternatives and similar repositories for Data-Science-Extensions

Users that are interested in Data-Science-Extensions are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AbsaOSS / Jdbc2S
View on GitHub
A JDBC streaming source for Spark
☆10Feb 19, 2024Updated 2 years ago
sutugin / spark-streaming-jdbc-source
View on GitHub
☆26Apr 15, 2021Updated 5 years ago
gravesee / rulefit
View on GitHub
Fit Lasso model to binary rules created from tree ensembles
☆12Aug 2, 2017Updated 8 years ago
cordon-thiago / spark-schema-merge
View on GitHub
Spark app to merge different schemas
☆23Dec 21, 2020Updated 5 years ago
hrbrmstr / gzmem
View on GitHub
Partial resurrection of the Rcompression package since memCompress/memDecompress are brain dead
☆11May 20, 2018Updated 8 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
bluejoe2008 / spark-http-stream
View on GitHub
spark structured streaming via HTTP communication
☆18Jul 7, 2022Updated 4 years ago
elastacloud / parquet-usql
View on GitHub
A custom extractor designed to read parquet for Azure Data Lake Analytics
☆13Feb 13, 2018Updated 8 years ago
selvinsource / spark-pmml-exporter-validator
View on GitHub
Using JPMML Evaluator to validate the PMML models exported from Spark
☆19May 1, 2017Updated 9 years ago
bernhard-42 / pyspark-atlas
View on GitHub
PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection
☆17Jan 12, 2017Updated 9 years ago
joomcode / spark-platform
View on GitHub
Basic Spark utilities
☆13Feb 20, 2025Updated last year
bolcom / avro-schema-viewer
View on GitHub
Visualizer for Avro Schemas (.avsc) - Try it yourself at:
☆33Apr 18, 2023Updated 3 years ago
moertel / sQucumber-redshift
View on GitHub
Cucumber-based framework for defining and executing SQL unit, integration and acceptance tests (for AWS Redshift, PostgreSQL)
☆13Sep 30, 2020Updated 5 years ago
SparkMonitor / varOne
View on GitHub
Apache Spark Web Monitor Tool, varOne
☆36Aug 26, 2016Updated 9 years ago
Reviewable / firecrypt
View on GitHub
Transparent at-rest AES encryption for Firebase.
☆16Updated this week
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
coddingtonbear / barbari
View on GitHub
Automates Flatcam generation of G-code for my (and maybe your) PCB milling process
☆10Jun 16, 2023Updated 3 years ago
pinterest / slackminion
View on GitHub
A python bot framework for slack
☆21Mar 20, 2024Updated 2 years ago
SpectatorWjx / JrebelLicenseServer
View on GitHub
Jrebel破解服务
☆13Oct 28, 2020Updated 5 years ago
pixipanda / FraudDetection
View on GitHub
Real-time Credit card Fraud detection using Spark Streaming, Spark ML, Spark SQL, Kafka, Cassandra and Airflow
☆11Jul 1, 2022Updated 4 years ago
trieu / netty-cookbook
View on GitHub
Supporting material (code, libs etc) for my Netty Cookbook
☆29Aug 8, 2023Updated 2 years ago
data-mill-cloud / data-mill
View on GitHub
A K8s-based infrastructure for analytics
☆24Jan 15, 2020Updated 6 years ago
ContainerSolutions / mesos-hello-world
View on GitHub
Very simple hello world mesos framework to demonstrate mini-mesos
☆12Jan 11, 2016Updated 10 years ago
jordanvolz / BasketballStats
View on GitHub
Basketball Statistics Demo
☆11Oct 18, 2016Updated 9 years ago
cloudera / dbt-spark-livy
View on GitHub
The dbt-spark-livy adapter allows you to use dbt along with Apache Spark, by connecting via Apache Livy
☆12Mar 30, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
FRosner / drunken-data-quality
View on GitHub
Spark package for checking data quality
☆220Feb 28, 2020Updated 6 years ago
celestinhermez / sparkify_customer_churn
View on GitHub
Modeling customer churn with Spark
☆12Jan 24, 2019Updated 7 years ago
openaire / vipe
View on GitHub
Tool for visualizing Apache Oozie pipelines
☆13Feb 15, 2016Updated 10 years ago
amesar / spark-python-scala-udf
View on GitHub
Demonstrates calling a Scala UDF from Python using spark-submit with an EGG and JAR
☆23Mar 3, 2020Updated 6 years ago
Pathairush / airflow_hive_spark_sqoop
View on GitHub
A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)
☆12May 2, 2021Updated 5 years ago
arun-gupta / docker-java-ides
View on GitHub
☆10Oct 16, 2025Updated 9 months ago
sdcoca / facex
View on GitHub
Geometrical Face Features Extraction
☆16Mar 30, 2013Updated 13 years ago
paulmw / hive-udf
View on GitHub
☆16Apr 17, 2014Updated 12 years ago
DuchessFrance / spark-in-practice-scala
View on GitHub
Play with the Spark, Spark streaming and DataFrame API.
☆12Jun 26, 2015Updated 11 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
mashin-io / oep
View on GitHub
The public repo for Oozie Editor Plugin.
☆16Feb 11, 2021Updated 5 years ago
implydata / druid-datagenerator
View on GitHub
A data generator for Apache Druid
☆12Mar 26, 2025Updated last year
ggear / cloudera-framework
View on GitHub
☆11Feb 14, 2020Updated 6 years ago
ZoinerTejada / mastering-azure-analytics
View on GitHub
Repository for code samples from the book Mastering Azure Analytics
☆25Apr 10, 2017Updated 9 years ago
empathyco / platform-spark-kubernetes-samples
View on GitHub
Spark on Kubernetes samples
☆20Jun 8, 2021Updated 5 years ago
saurfang / spark-sas7bdat
View on GitHub
Splittable SAS (.sas7bdat) Input Format for Hadoop and Spark SQL
☆99Jan 22, 2026Updated 6 months ago
suhangpro / dpm-face
View on GitHub
Face detection with alignment from unconstrained photos
☆12Sep 29, 2015Updated 10 years ago