skrusche63/spark-connect

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/skrusche63/spark-connect)

skrusche63 / spark-connect

A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other data sources from Apache Spark.

☆13

Alternatives and similar repositories for spark-connect

Users that are interested in spark-connect are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Mageswaran1989 / awesome-ApacheSpark-collections
View on GitHub
A curated list of awesome Apache Spark packages and resources.
☆40Mar 14, 2017Updated 9 years ago
OopsOutOfMemory / spark-sql-hbase
View on GitHub
A Spark SQL HBase connector
☆29May 4, 2015Updated 11 years ago
matrix2011 / MongodbSync
View on GitHub
mongodb synchronization, mongodb sync
☆13Aug 9, 2017Updated 8 years ago
datawlb / code
View on GitHub
code exercise: dbscan(ballTree improve) | ctr(ftrl) | text classification(bayes..) | kmeans | general LR |..
☆26Jan 21, 2016Updated 10 years ago
alteryx / sparkGLM
View on GitHub
An R-like GLM package for Apache Spark
☆10Aug 6, 2015Updated 10 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
blockchain / bitcoin-sfox-client
View on GitHub
☆11Nov 8, 2018Updated 7 years ago
taherh / pysimsearch
View on GitHub
Python library for similarity search on text data (such as web pages). Currently intended primarily for pedagogical purposes.
☆14Oct 8, 2011Updated 14 years ago
lyveng / pandas-hbase
View on GitHub
Pandas Helper Library for reading and writing DataFrames from and to HBase.
☆10Mar 8, 2018Updated 8 years ago
trulia / thoth-ml
View on GitHub
☆15Jan 3, 2015Updated 11 years ago
matroid / matroid-python
View on GitHub
Python bindings for Matroid API
☆18Aug 14, 2025Updated 11 months ago
rocky1001 / caravel
View on GitHub
Caravel is a data exploration platform designed to be visual, intuitive, and interactive
☆20Aug 30, 2016Updated 9 years ago
Gschiavon / Kafka-SparkStreaming-HDFS
View on GitHub
☆14Nov 3, 2016Updated 9 years ago
rstudio / expert
View on GitHub
Course materials for Expert Data Wrangling with R. To purchase the videos or watch smaple lessons, visit http://shop.oreilly.com/product/…
☆11Sep 14, 2015Updated 10 years ago
lotze / bandit
View on GitHub
R package for split test/one-armed bandit analysis
☆16May 5, 2014Updated 12 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
networm / progit2-zh
View on GitHub
☆10Jun 7, 2020Updated 6 years ago
imwally / pin
View on GitHub
Simple command line pinboard client.
☆13Feb 3, 2020Updated 6 years ago
topepo / odsc_rules
View on GitHub
Notes and code for the workshop "Rule-Based Models for Regression and Classification”
☆13May 21, 2016Updated 10 years ago
Yuchun-Zhang / R_largeList
View on GitHub
Store, append, read large lists in R without loading whole data into memory.
☆14Apr 18, 2017Updated 9 years ago
sunroyi / SpringCloud
View on GitHub
☆22Jul 22, 2023Updated 3 years ago
tmalaska / SparkStreaming.Sessionization
View on GitHub
NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase
☆50Oct 31, 2014Updated 11 years ago
EdwardsBean / flume-spark-streaming
View on GitHub
conbine flume,spark-streaming and redis for real-time computing
☆22Oct 20, 2014Updated 11 years ago
sinanuozdemir / sfdat22
View on GitHub
SF DAT 22 Course Repository
☆13Jun 3, 2016Updated 10 years ago
ClaudiuCreanga / hands-on-machine-learning-scikit-learn-tensorflow-oreilly-geron
View on GitHub
Book Hands on Machine Learning with Scikit-Learn and Tensorflow from O'reilly - Geron
☆10May 11, 2017Updated 9 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
databricks / spark-package-cmd-tool
View on GitHub
A command line tool for Spark packages
☆19Mar 30, 2023Updated 3 years ago
aredotna / arena-rb
View on GitHub
A Ruby interface to the Arena API.
☆17Oct 13, 2025Updated 9 months ago
cwtree / flume-filemonitor-source
View on GitHub
This source can record the position of file if the flume application has been killed,it also know which line should be read from next tim…
☆19Jan 9, 2017Updated 9 years ago
imevro / ghost-supernova-theme
View on GitHub
Pure design for Ghost.js
☆17Feb 3, 2015Updated 11 years ago
OndraFiedler / spark-recommender
View on GitHub
Scalable recommendation system written in Scala using the Apache Spark framework
☆105Jan 30, 2015Updated 11 years ago
dengleitju / flatq
View on GitHub
轻量级分布式消息队列
☆25Jan 29, 2015Updated 11 years ago
ProjectTw / TwitteR2Mongo
View on GitHub
R Package to stream and analyze tweets using a mongodb
☆13Mar 1, 2016Updated 10 years ago
szilard / dataset-sizes-kdnuggets
View on GitHub
Size of datasets used for analytics based on 10 years of surveys by KDnuggets.
☆16Nov 18, 2015Updated 10 years ago
elijahr / lk
View on GitHub
a programmer's search tool, parallel and fast.
☆16Jun 29, 2012Updated 14 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
MangoTheCat / Modelling-Airbnb-Prices
View on GitHub
Modelling Airbnb prices in London using different Machine Learning models (Random Forest, Gradient Boosting, Neural Network)
☆10Feb 5, 2019Updated 7 years ago
vighneshbirodkar / pca
View on GitHub
A comparison of various Robust PCA implementations
☆15Apr 19, 2016Updated 10 years ago
opengurukul / oracleplsql-examples
View on GitHub
Oracle PL/SQL Examples
☆11Sep 8, 2012Updated 13 years ago
zoepepper / scalajs-jsjoda
View on GitHub
Scala.js facade for JS-Joda with drop-in to use it as JSR310 implementation.
☆17Jul 5, 2020Updated 6 years ago
CodeRayZhang / Spark-Example
View on GitHub
Spark1.6和spark2.2的示例，包含kafka,flume,structuredstreaming,jedis,elasticsearch,mysql,dataframe
☆15Jan 28, 2018Updated 8 years ago
SHSE / spark-es
View on GitHub
ElasticSearch integration for Apache Spark
☆47Apr 5, 2016Updated 10 years ago
mohamed / roofline
View on GitHub
A simple script to plot the Roofline model for given HW platforms and applications
☆10Mar 17, 2026Updated 4 months ago