Stocator is high performing connector to object storage for Apache Spark, achieving performance by leveraging object storage semantics.
☆115May 17, 2024Updated 2 years ago
Alternatives and similar repositories for stocator
Users that are interested in stocator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆35Jun 5, 2026Updated last week
- Lithops-based Serverless implementation of the METASPACE spatial metabolomics annotation pipeline☆12Jul 6, 2023Updated 2 years ago
- Fybrik☆132Sep 7, 2025Updated 9 months ago
- Mirror of Apache crail (Incubating)☆151Jul 3, 2022Updated 3 years ago
- Netezza Connector for Apache Spark☆13Sep 10, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- RedRock - Mobile Application prototype using Apache Spark, Twitter and Elasticsearch☆15Sep 10, 2018Updated 7 years ago
- [Archived] A Fast Multi-tiered Distributed Storage System based on User-Level I/O☆75Mar 2, 2018Updated 8 years ago
- Example projects for 'BigInsights for Apache Hadoop' on IBM Bluemix☆23Sep 27, 2017Updated 8 years ago
- A terminal emulator embedded in a IPython/Jupyter notebook.☆27Feb 3, 2022Updated 4 years ago
- Fast I/O plugins for Spark☆41Dec 14, 2020Updated 5 years ago
- Mirror of Apache Bahir☆337Jul 7, 2023Updated 2 years ago
- Elephant Twin is a framework for creating indexes in Hadoop☆99Oct 12, 2020Updated 5 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Oct 27, 2016Updated 9 years ago
- Cache File System optimized for columnar formats and object stores☆188Aug 11, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream …☆22Feb 6, 2017Updated 9 years ago
- Mirror of Apache Toree (Incubating)☆750May 30, 2026Updated 2 weeks ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Aug 23, 2017Updated 8 years ago
- Rocksdb state storage implementation for Structured Streaming.☆17Oct 21, 2020Updated 5 years ago
- riemann tool for cassandra☆31May 19, 2016Updated 10 years ago
- On demand presto cluster with mesos, marathon and docker.☆29Mar 7, 2018Updated 8 years ago
- PySpark Notebook and Shiny App for Demo☆34Mar 24, 2017Updated 9 years ago
- Python Helper library for Jupyter Notebooks☆1,041Feb 16, 2021Updated 5 years ago
- Benchmark Suite for Apache Spark☆242Apr 12, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆24Feb 4, 2021Updated 5 years ago
- Hadoop YARN & MapReduce Memory Calculator☆13Nov 9, 2015Updated 10 years ago
- Flash cache solution iostash☆11Jun 23, 2016Updated 9 years ago
- docker image to deploy rabbitmq cluster on mesos with one marathon app☆10Oct 12, 2017Updated 8 years ago
- Native python client for Infinispan, over the Hot Rod wire protocol☆17Jan 30, 2024Updated 2 years ago
- ☆11May 16, 2022Updated 4 years ago
- A quotation-based Scala DSL for scalable data analysis.☆65Jul 7, 2022Updated 3 years ago
- Jupyter Hub Support in VS Code☆17Updated this week
- A file system backed by AntidoteDB.☆13Jun 10, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A persistent LSM key-value store. FloDB is designed to scale with the number of threads and memory size.☆26Mar 28, 2017Updated 9 years ago
- Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components☆10Oct 11, 2019Updated 6 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- RGW PubSub API Clients☆14Dec 4, 2019Updated 6 years ago
- Simplifies the way Java developers connect to services in Bluemix.☆18May 7, 2019Updated 7 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆30Feb 1, 2016Updated 10 years ago
- Stocks -> NiFi -> Kafka -> Profit☆14Nov 16, 2018Updated 7 years ago