Big Data Processing Framework - Unified Data API or SQL on Any Storage
☆252Jul 10, 2025Updated 10 months ago
Alternatives and similar repositories for gimel
Users that are interested in gimel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Example Postman collections for PayPal APIs☆23Apr 8, 2026Updated last month
- ☆31May 18, 2026Updated last week
- Basic framework utilities to quickly start writing production ready Apache Spark applications☆36Dec 15, 2024Updated last year
- The Hyperwallet Mirakl Connector (HMC) is a self-hosted solution that mediates between a Mirakl marketplace solution and the Hyperwallet …☆41Nov 26, 2025Updated 6 months ago
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆30May 13, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Extensible streaming ingestion pipeline on top of Apache Spark☆46Jul 17, 2025Updated 10 months ago
- Demo app for paypal-checkout☆53Nov 23, 2023Updated 2 years ago
- Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…☆97Nov 14, 2019Updated 6 years ago
- Qubole Sparklens tool for performance tuning Apache Spark☆589Jun 26, 2024Updated last year
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35May 6, 2026Updated 3 weeks ago
- Iceberg is a table format for large, slow-moving tabular data☆493Apr 10, 2023Updated 3 years ago
- Generic Data Ingestion & Dispersal Library for Hadoop☆482Mar 19, 2023Updated 3 years ago
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Jan 21, 2020Updated 6 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆28May 15, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- something to help you spark☆65Oct 23, 2018Updated 7 years ago
- Library and a Framework for building fast, scalable, fault-tolerant Data APIs based on Akka, Avro, ZooKeeper and Kafka☆25Oct 16, 2020Updated 5 years ago
- Tools for rewriting and optimizing DAGs (directed-acyclic graphs) in Scala☆151Mar 20, 2022Updated 4 years ago
- Hortonworks Data Platform Data Generation Tool☆13Nov 30, 2017Updated 8 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,263May 19, 2026Updated last week
- Open Source Secret Provider plugin for the Kafka Connect framework☆47Jul 19, 2024Updated last year
- Mirus is a cross data-center data replication tool for Apache Kafka☆209May 21, 2026Updated last week
- Avro2TF is designed to fill the gap of making users' training data ready to be consumed by deep learning training frameworks.☆129May 9, 2020Updated 6 years ago
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆430Jan 14, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆1,684May 5, 2026Updated 3 weeks ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Sep 8, 2022Updated 3 years ago
- Akka Streams & Akka HTTP for Large-Scale Production Deployments☆1,434Apr 17, 2024Updated 2 years ago
- A simplified, lightweight ETL Framework based on Apache Spark☆588Jan 24, 2024Updated 2 years ago
- The sane way of building a data layer in Airflow☆24Dec 5, 2019Updated 6 years ago
- JupyterLab extension to provide a Kubeflow specific left area for Notebooks deployment☆17Apr 14, 2020Updated 6 years ago
- Awesome List for Infrastructure as Code☆15Jul 15, 2017Updated 8 years ago
- Export Airflow metrics (from mysql) in prometheus format☆29Apr 15, 2025Updated last year
- Big Data Toolkit for the JVM☆148Nov 4, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆306Oct 30, 2025Updated 6 months ago
- ☆64Nov 8, 2019Updated 6 years ago
- A tool for scale and performance testing of HDFS with a specific focus on the NameNode.☆135Jan 11, 2024Updated 2 years ago
- Thoughts on things I find interesting.☆17Dec 19, 2024Updated last year
- A Monitor over HBase, including Table,Region,RegionServer,Zookeeper monitoring etc.☆54Dec 20, 2018Updated 7 years ago
- High performance data store solution☆1,446May 15, 2026Updated 2 weeks ago
- Quark is a data virtualization engine over analytic databases.☆101Jul 13, 2017Updated 8 years ago