hughhyndman / kdbsparkLinks
Spark Data Source (V2) for Kx Systems kdb+ Database
☆21Updated 5 years ago
Alternatives and similar repositories for kdbspark
Users that are interested in kdbspark are comparing it to the libraries listed below
Sorting:
- kdb+ to Apache Kafka adapter, for pub/sub☆54Updated last year
- ☆107Updated 2 years ago
- Snowflake Data Source for Apache Spark.☆230Updated 2 weeks ago
- an anagram☆136Updated 4 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- Palantir Distribution of Apache Spark☆70Updated 2 years ago
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆429Updated 3 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆60Updated 2 years ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆96Updated last month
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆184Updated 2 weeks ago
- ACID Data Source for Apache Spark based on Hive ACID☆97Updated 4 years ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆70Updated 2 months ago
- Cache File System optimized for columnar formats and object stores☆184Updated 3 years ago
- Mirror of Apache DataFu☆120Updated 5 months ago
- A library for Spark DataFrame using MinIO Select API☆99Updated 6 years ago
- Components for building stream loaders from Kafka to arbitrary storages☆37Updated this week
- Quark is a data virtualization engine over analytic databases.☆100Updated 8 years ago
- Extensible streaming ingestion pipeline on top of Apache Spark☆45Updated 3 months ago
- A COBOL parser and Mainframe/EBCDIC data source for Apache Spark☆155Updated this week
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 4 years ago
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 4 years ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆65Updated 2 years ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆122Updated last week
- Iceberg is a table format for large, slow-moving tabular data☆483Updated 2 years ago
- The Internals of Delta Lake☆186Updated 9 months ago
- A simple Spark-powered ETL framework that just works 🍺☆182Updated last month
- Vectorized processing for Apache Arrow☆485Updated 3 years ago
- Helm Chart for lyft/flinkk8soperator☆11Updated 5 years ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Updated 8 years ago
- Spark SQL index for Parquet tables☆134Updated 4 years ago