☆70Mar 15, 2021Updated 5 years ago
Alternatives and similar repositories for Data-Science-Extensions
Users that are interested in Data-Science-Extensions are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A JDBC streaming source for Spark☆10Feb 19, 2024Updated 2 years ago
- ☆26Apr 15, 2021Updated 5 years ago
- Fit Lasso model to binary rules created from tree ensembles☆12Aug 2, 2017Updated 8 years ago
- Spark app to merge different schemas☆23Dec 21, 2020Updated 5 years ago
- Partial resurrection of the Rcompression package since memCompress/memDecompress are brain dead☆11May 20, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Mar 21, 2016Updated 10 years ago
- A custom extractor designed to read parquet for Azure Data Lake Analytics☆13Feb 13, 2018Updated 8 years ago
- spark structured streaming via HTTP communication☆18Jul 7, 2022Updated 3 years ago
- Using JPMML Evaluator to validate the PMML models exported from Spark☆19May 1, 2017Updated 9 years ago
- notebooks for nlp-on-spark☆13Jan 27, 2017Updated 9 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆17Jan 12, 2017Updated 9 years ago
- Nano product template – clone this to start a new project!☆30Nov 18, 2015Updated 10 years ago
- Transparent at-rest AES encryption for Firebase.☆16Jun 12, 2026Updated 2 weeks ago
- Some examples on how to use Apache Calcite☆12May 16, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Nested array transformation helper extensions for Apache Spark☆37Aug 4, 2023Updated 2 years ago
- Serverless Apache Spark On AWS Fargate☆17Jun 1, 2019Updated 7 years ago
- Basic Spark utilities☆13Feb 20, 2025Updated last year
- Algebird's HyperLogLog support for Apache Spark.☆10Jul 20, 2017Updated 8 years ago
- This sample demonstrates how to make a use of modules provided by Microsoft Azure File Service in Python.☆11Apr 21, 2021Updated 5 years ago
- How to execute a REST API call in Apache Spark the right way, using Scala☆19Oct 11, 2022Updated 3 years ago
- Generate mock data based on an Apache Avro schema and specific cardinality settings☆10Apr 16, 2018Updated 8 years ago
- A python bot framework for slack☆21Mar 20, 2024Updated 2 years ago
- Jrebel破解服务☆13Oct 28, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Sample custom Nifi processor to process tcpdump☆18Nov 19, 2015Updated 10 years ago
- Schema and type system for creating sortable byte[]☆47Jan 30, 2013Updated 13 years ago
- Basketball Statistics Demo☆11Oct 18, 2016Updated 9 years ago
- A K8s-based infrastructure for analytics☆24Jan 15, 2020Updated 6 years ago
- Single view demo☆14Feb 13, 2016Updated 10 years ago
- The dbt-spark-livy adapter allows you to use dbt along with Apache Spark, by connecting via Apache Livy☆12Mar 30, 2023Updated 3 years ago
- Spark package for checking data quality☆220Feb 28, 2020Updated 6 years ago
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆24Sep 25, 2014Updated 11 years ago
- Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline☆77Feb 15, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Tool for visualizing Apache Oozie pipelines☆13Feb 15, 2016Updated 10 years ago
- Codec for Hadoop adding OpenPGP encryption using Bouncy Castle☆17Aug 18, 2011Updated 14 years ago
- Demonstrates calling a Scala UDF from Python using spark-submit with an EGG and JAR☆23Mar 3, 2020Updated 6 years ago
- A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)☆12May 2, 2021Updated 5 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆97Jul 7, 2021Updated 4 years ago
- Spark on Kubernetes samples☆20Jun 8, 2021Updated 5 years ago
- An implementation of GloVe model for learning word representations for big text corpuses distributed with Apache Spark.☆15Feb 25, 2018Updated 8 years ago