CODAIT/stocator

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CODAIT/stocator)

CODAIT / stocator

Stocator is high performing connector to object storage for Apache Spark, achieving performance by leveraging object storage semantics.

☆115

Alternatives and similar repositories for stocator

Users that are interested in stocator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ibm-watson-data-lab / ibmos2spark
View on GitHub
Facilitates Data I/O between Spark and IBM Object Storage services.
☆10Feb 26, 2019Updated 7 years ago
CODAIT / spark-db2
View on GitHub
DB2/DashDB Connector for Apache Spark
☆14Jul 30, 2021Updated 4 years ago
IBM / dsx-twitter-auto-analysis
View on GitHub
WARNING: This repository is no longer maintained The Insights for Twitter service from IBM Cloud has been sunset. This repository will n…
☆11Apr 10, 2019Updated 7 years ago
IBM-Cloud / ibm-cloud-cli-sdk
View on GitHub
☆35Updated this week
apache / couchdb-chttpd
View on GitHub
Mirror of Apache CouchDB
☆16Dec 10, 2018Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
apache / incubator-crail
View on GitHub
Mirror of Apache crail (Incubating)
☆152Jul 3, 2022Updated 4 years ago
CODAIT / redrock
View on GitHub
RedRock - Mobile Application prototype using Apache Spark, Twitter and Elasticsearch
☆15Sep 10, 2018Updated 7 years ago
zrlio / crail
View on GitHub
[Archived] A Fast Multi-tiered Distributed Storage System based on User-Level I/O
☆75Mar 2, 2018Updated 8 years ago
IBM-Cloud / BigInsights-on-Apache-Hadoop
View on GitHub
Example projects for 'BigInsights for Apache Hadoop' on IBM Bluemix
☆23Sep 27, 2017Updated 8 years ago
adamj9431 / notebook_xterm
View on GitHub
A terminal emulator embedded in a IPython/Jupyter notebook.
☆27Feb 3, 2022Updated 4 years ago
zrlio / crail-spark-io
View on GitHub
Fast I/O plugins for Spark
☆42Dec 14, 2020Updated 5 years ago
steveloughran / formality
View on GitHub
Formal Methods, Maths and papers
☆23Dec 10, 2025Updated 7 months ago
apache / bahir
View on GitHub
Mirror of Apache Bahir
☆336Jul 7, 2023Updated 3 years ago
qubole / rubix
View on GitHub
Cache File System optimized for columnar formats and object stores
☆188Aug 11, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
TrueCar / mleap
View on GitHub
MLeap allows for easily putting Spark ML pipelines into production
☆78Oct 27, 2016Updated 9 years ago
IBM / SVCheck
View on GitHub
SVCheck - Spectrum Virtualize Checker
☆10Sep 23, 2021Updated 4 years ago
jeoffreylim / maelstrom
View on GitHub
Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream …
☆21Feb 6, 2017Updated 9 years ago
ibm-research-ireland / sparkoscope
View on GitHub
Enabling Spark Optimization through Cross-stack Monitoring and Visualization
☆47Aug 23, 2017Updated 8 years ago
mirkoprescha / spark-zeppelin-docker
View on GitHub
docker image with spark and zeppelin
☆12May 28, 2019Updated 7 years ago
IBMDataScience / SparkSummitDemo
View on GitHub
PySpark Notebook and Shiny App for Demo
☆34Mar 24, 2017Updated 9 years ago
qubole / spark-state-store
View on GitHub
Rocksdb state storage implementation for Structured Streaming.
☆17Oct 21, 2020Updated 5 years ago
tspannhw / stocks-nifi-kafka
View on GitHub
Stocks -> NiFi -> Kafka -> Profit
☆14Nov 16, 2018Updated 7 years ago
ibm-research / iostash
View on GitHub
Flash cache solution iostash
☆11Jun 23, 2016Updated 10 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CODAIT / spark-bench
View on GitHub
Benchmark Suite for Apache Spark
☆242Apr 12, 2023Updated 3 years ago
RunningJon / outsourcer
View on GitHub
☆24Feb 4, 2021Updated 5 years ago
bomeng / Heracles
View on GitHub
High performance HBase / Spark SQL engine
☆28Jul 7, 2022Updated 4 years ago
ryandawsonuk / minions
View on GitHub
Minions for minikube - a demo of kubernetes features
☆15Mar 4, 2018Updated 8 years ago
miguel10 / YARN-Memory-Calculator
View on GitHub
Hadoop YARN & MapReduce Memory Calculator
☆13Nov 9, 2015Updated 10 years ago
brightcove-archive / ooyala_scamr
View on GitHub
A Hadoop map reduce framework for Scala.
☆15Apr 21, 2016Updated 10 years ago
RedisLabs / spark-timeseries
View on GitHub
A library for financial and time series calculations on Apache Spark
☆28Feb 2, 2016Updated 10 years ago
sequenceiq / docker-serf
View on GitHub
Serf on Docker containers
☆34Feb 22, 2015Updated 11 years ago
mitodl / release-script
View on GitHub
Scripts to automate the release process, aka "Doof"
☆16Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ekesken / docker-rabbitmq
View on GitHub
docker image to deploy rabbitmq cluster on mesos with one marathon app
☆10Oct 12, 2017Updated 8 years ago
hpcugent / hanythingondemand
View on GitHub
hanythingondemand provides a set of scripts to easily set up an ad-hoc Hadoop cluster through PBS jobs
☆12Jul 2, 2019Updated 7 years ago
microsoft / vscode-jupyter-hub
View on GitHub
Jupyter Hub Support in VS Code
☆17Jul 13, 2026Updated last week
CODAIT / aardpfark
View on GitHub
A library for exporting Spark ML models and pipelines to PFA
☆55Nov 21, 2018Updated 7 years ago
SyncFree / antidote-fs
View on GitHub
A file system backed by AntidoteDB.
☆13Jun 10, 2021Updated 5 years ago
LPD-EPFL / flodb
View on GitHub
A persistent LSM key-value store. FloDB is designed to scale with the number of threads and memory size.
☆26Mar 28, 2017Updated 9 years ago
CARV-ICS-FORTH / H3
View on GitHub
H3 is an embedded object store in C, Python, and Java
☆12Jul 7, 2021Updated 5 years ago