Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, short-term tokens, and lineage.
☆70Aug 27, 2025Updated 6 months ago
Alternatives and similar repositories for rokku
Users that are interested in rokku are comparing it to the libraries listed below
Sorting:
- Apache Ranger Plugin for S3☆20Nov 30, 2022Updated 3 years ago
- Dedicated Kafka Connector to track changes in MLflow Model Registry☆10Jan 8, 2021Updated 5 years ago
- Adaptive File Source Connector for Spark, optimised for reading from object stores☆15Oct 18, 2022Updated 3 years ago
- Hadoop InputFormat for http://druid.io/☆10Oct 26, 2016Updated 9 years ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Aug 17, 2015Updated 10 years ago
- JupyterLab Notebook for Mesosphere DC/OS☆11Aug 6, 2019Updated 6 years ago
- Ultra-high-performance local IPC framework with Zipkin tracing to conduct a beautiful symphony of (brotherhood) build tooling.☆10Jan 8, 2021Updated 5 years ago
- Code that was used as an example during the Data+AI Summit 2020☆15Mar 8, 2021Updated 5 years ago
- The Reactive Geospatial Server☆20Feb 24, 2021Updated 5 years ago
- Airflow declarative DAGs via YAML☆133Sep 18, 2023Updated 2 years ago
- MaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.☆14Apr 12, 2022Updated 3 years ago
- ☆41May 16, 2023Updated 2 years ago
- Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store☆17Oct 20, 2022Updated 3 years ago
- dbt adapter for Athena☆38May 28, 2024Updated last year
- Realtime feedback of Akka Stream processing via WebSockets☆16Dec 9, 2019Updated 6 years ago
- ☆67Apr 14, 2017Updated 8 years ago
- ☆21Mar 17, 2023Updated 2 years ago
- Example implementation running Airflow as separate services with docker-compose.☆19Nov 5, 2018Updated 7 years ago
- Paper: A Zero-rename committer for object stores☆20Nov 7, 2025Updated 4 months ago
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆18Feb 23, 2026Updated 2 weeks ago
- Docker images for Camunda BPM Enterprise Edition☆21Mar 11, 2020Updated 5 years ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Sep 8, 2016Updated 9 years ago
- Apache Atlas development image for the Rokku project: https://github.com/ing-bank/rokku☆22Jun 9, 2020Updated 5 years ago
- Big Data Newsletter☆23Apr 12, 2024Updated last year
- A library for monitoring Akka that uses Micrometer metrics☆24Mar 24, 2023Updated 2 years ago
- Don't Panic. This guide will help you when it feels like the end of the world.☆30Feb 7, 2026Updated last month
- Ansible playbooks for Apache Spark on kube☆27Jul 20, 2017Updated 8 years ago
- Scala framework for collecting performance metrics and conducting sound experimental benchmarking.☆13Nov 19, 2025Updated 3 months ago
- Open-Source Billing And Rating Platform For Subscription☆10Feb 5, 2024Updated 2 years ago
- A load balancer / proxy / gateway for prestodb☆358Jul 25, 2024Updated last year
- App-level Chaos Engineering☆28Apr 27, 2021Updated 4 years ago
- Python library providing sentiment lexicons.☆26Dec 15, 2016Updated 9 years ago
- Simplicity and high performance for managing microservices☆18Feb 25, 2023Updated 3 years ago
- Apache Spark OpenCPU Executor (ROSE)☆26Jun 16, 2018Updated 7 years ago
- Boilerplate project for MOTW Workshop 2015☆10Mar 3, 2016Updated 10 years ago
- ☆31Oct 25, 2018Updated 7 years ago
- WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging …☆31Oct 28, 2025Updated 4 months ago
- Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.☆76Apr 24, 2024Updated last year
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Dec 14, 2022Updated 3 years ago