A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other data sources from Apache Spark.
☆13Feb 23, 2015Updated 11 years ago
Alternatives and similar repositories for spark-connect
Users that are interested in spark-connect are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A curated list of awesome Apache Spark packages and resources.☆40Mar 14, 2017Updated 9 years ago
- A Spark SQL HBase connector☆29May 4, 2015Updated 11 years ago
- Python library for similarity search on text data (such as web pages). Currently intended primarily for pedagogical purposes.☆14Oct 8, 2011Updated 14 years ago
- mongodb synchronization, mongodb sync☆13Aug 9, 2017Updated 8 years ago
- Pandas Helper Library for reading and writing DataFrames from and to HBase.☆10Mar 8, 2018Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is the repo with the code snippets that supply the "R + Google Analytics = FUN" post regarding getting speed metrics and clickstream…☆31Jun 24, 2016Updated 9 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Aug 21, 2013Updated 12 years ago
- ☆14Nov 3, 2016Updated 9 years ago
- Python bindings for Matroid API☆17Aug 14, 2025Updated 10 months ago
- Django Based Agile project tracking system. Manage projects, tickets, milestones, the whole nine yards.☆19Feb 5, 2015Updated 11 years ago
- Course materials for Expert Data Wrangling with R. To purchase the videos or watch smaple lessons, visit http://shop.oreilly.com/product/…☆11Sep 14, 2015Updated 10 years ago
- R package for split test/one-armed bandit analysis☆16May 5, 2014Updated 12 years ago
- An R-like GLM package for Apache Spark☆10Aug 6, 2015Updated 10 years ago
- Caravel is a data exploration platform designed to be visual, intuitive, and interactive☆20Aug 30, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Notes and code for the workshop "Rule-Based Models for Regression and Classification”☆13May 21, 2016Updated 10 years ago
- NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase☆50Oct 31, 2014Updated 11 years ago
- conbine flume,spark-streaming and redis for real-time computing☆22Oct 20, 2014Updated 11 years ago
- Scalable recommendation system written in Scala using the Apache Spark framework☆105Jan 30, 2015Updated 11 years ago
- This source can record the position of file if the flume application has been killed,it also know which line should be read from next tim…☆19Jan 9, 2017Updated 9 years ago
- Book Hands on Machine Learning with Scikit-Learn and Tensorflow from O'reilly - Geron☆10May 11, 2017Updated 9 years ago
- Example application on how to use mongo-hadoop connector with Spark☆90Feb 18, 2014Updated 12 years ago
- R Package to stream and analyze tweets using a mongodb☆13Mar 1, 2016Updated 10 years ago
- Oracle PL/SQL Examples☆11Sep 8, 2012Updated 13 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Modelling Airbnb prices in London using different Machine Learning models (Random Forest, Gradient Boosting, Neural Network)☆10Feb 5, 2019Updated 7 years ago
- Spark1.6和spark2.2的示例,包含kafka,flume,structuredstreaming,jedis,elasticsearch,mysql,dataframe☆15Jan 28, 2018Updated 8 years ago
- A simple script to plot the Roofline model for given HW platforms and applications☆10Mar 17, 2026Updated 3 months ago
- ElasticSearch integration for Apache Spark☆47Apr 5, 2016Updated 10 years ago
- SF DAT 22 Course Repository☆13Jun 3, 2016Updated 10 years ago
- The R code compares the performance metrics between logistic regression, SVM, Naive Bayes, Knn and random forest classifers in a 10 fold …☆15Mar 13, 2016Updated 10 years ago
- 🚗 mini self driving car☆18Sep 7, 2016Updated 9 years ago
- Simple Akka HTTP Client DSL for Scala☆11Apr 24, 2017Updated 9 years ago
- ☆11Oct 10, 2014Updated 11 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A ZooKeeper client library in Scala.☆21Apr 17, 2013Updated 13 years ago
- Raphael, Prototype Analytics Line Chart with Multiple Data☆52Jan 30, 2013Updated 13 years ago
- Using the Slick code generator to generate Slick and Play code☆49May 27, 2016Updated 10 years ago
- Spark, Cassandra, Tessellation and ArcGIS☆10Jan 18, 2015Updated 11 years ago
- A simple template for TensorFlow's highly efficient CudnnLSTM module☆11Jun 8, 2018Updated 8 years ago
- [ICME 2019] Source code and datasets for "Semi-supervised Compatibility Learning Across Categories for Clothing Matching"☆11Apr 26, 2024Updated 2 years ago
- 水花一现-spring boot后端☆19Mar 1, 2018Updated 8 years ago