Stocator is high performing connector to object storage for Apache Spark, achieving performance by leveraging object storage semantics.
☆115May 17, 2024Updated 2 years ago
Alternatives and similar repositories for stocator
Users that are interested in stocator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Facilitates Data I/O between Spark and IBM Object Storage services.☆10Feb 26, 2019Updated 7 years ago
- DB2/DashDB Connector for Apache Spark☆14Jul 30, 2021Updated 4 years ago
- WARNING: This repository is no longer maintained The Insights for Twitter service from IBM Cloud has been sunset. This repository will n…☆11Apr 10, 2019Updated 7 years ago
- Lithops application examples☆11Dec 19, 2024Updated last year
- ☆35May 18, 2026Updated last week
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Fybrik☆132Sep 7, 2025Updated 8 months ago
- Mirror of Apache crail (Incubating)☆151Jul 3, 2022Updated 3 years ago
- Netezza Connector for Apache Spark☆13Sep 10, 2018Updated 7 years ago
- [Archived] A Fast Multi-tiered Distributed Storage System based on User-Level I/O☆75Mar 2, 2018Updated 8 years ago
- Example projects for 'BigInsights for Apache Hadoop' on IBM Bluemix☆23Sep 27, 2017Updated 8 years ago
- A terminal emulator embedded in a IPython/Jupyter notebook.☆27Feb 3, 2022Updated 4 years ago
- Fast I/O plugins for Spark☆41Dec 14, 2020Updated 5 years ago
- A Gateway for connecting application services in different domains, networks, and cloud infrastructures☆23Feb 1, 2026Updated 3 months ago
- Protect communications with adversarial neural cryptography.☆11Oct 31, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- MLeap allows for easily putting Spark ML pipelines into production☆78Oct 27, 2016Updated 9 years ago
- Cache File System optimized for columnar formats and object stores☆188Aug 11, 2022Updated 3 years ago
- Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream …☆22Feb 6, 2017Updated 9 years ago
- Mirror of Apache Toree (Incubating)☆750May 15, 2026Updated last week
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Jan 21, 2020Updated 6 years ago
- SVCheck - Spectrum Virtualize Checker☆10Sep 23, 2021Updated 4 years ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Aug 23, 2017Updated 8 years ago
- Rocksdb state storage implementation for Structured Streaming.☆17Oct 21, 2020Updated 5 years ago
- ☆13Dec 2, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15Apr 21, 2022Updated 4 years ago
- On demand presto cluster with mesos, marathon and docker.☆29Mar 7, 2018Updated 8 years ago
- A Kafka JMX configuration file☆20Jul 9, 2018Updated 7 years ago
- Python Helper library for Jupyter Notebooks☆1,041Feb 16, 2021Updated 5 years ago
- Benchmark Suite for Apache Spark☆242Apr 12, 2023Updated 3 years ago
- Java API for libaio☆14Jan 10, 2022Updated 4 years ago
- ☆24Feb 4, 2021Updated 5 years ago
- Hadoop YARN & MapReduce Memory Calculator☆13Nov 9, 2015Updated 10 years ago
- Flash cache solution iostash☆11Jun 23, 2016Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- docker image to deploy rabbitmq cluster on mesos with one marathon app☆10Oct 12, 2017Updated 8 years ago
- High performance HBase / Spark SQL engine☆28Jul 7, 2022Updated 3 years ago
- ibm-cos-sdk-go☆18Mar 5, 2026Updated 2 months ago
- ☆11May 16, 2022Updated 4 years ago
- A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel app…☆365May 19, 2026Updated last week
- A quotation-based Scala DSL for scalable data analysis.☆65Jul 7, 2022Updated 3 years ago
- A library for exporting Spark ML models and pipelines to PFA☆55Nov 21, 2018Updated 7 years ago