ActianCorp / spark-vectorLinks
Repository for the Spark-Vector connector
☆20Updated 4 years ago
Alternatives and similar repositories for spark-vector
Users that are interested in spark-vector are comparing it to the libraries listed below
Sorting:
- ☆39Updated 6 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆61Updated last year
- Testbench for experimenting with Apache Hive at any data scale.☆64Updated 8 years ago
- Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.☆50Updated 9 years ago
- Example project showing how to use Hive UDFs in Apache Spark☆55Updated 6 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 8 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 8 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- Ambari stack service for easily installing and managing Solr on HDP cluster☆19Updated 6 years ago
- Helpful user defined fuctions / table generating functions for Hive☆101Updated 9 years ago
- Spark SQL UDF examples☆56Updated 7 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- Flink Examples☆39Updated 9 years ago
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Updated 7 years ago
- Spark Streaming HBase Example☆22Updated 9 years ago
- Spark cloud integration: tests, cloud committers and more☆19Updated 5 months ago
- Cask Hydrator Plugins Repository☆68Updated this week
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Updated 4 years ago
- Splittable Gzip codec for Hadoop☆71Updated 2 weeks ago
- functionstest☆33Updated 8 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- A tutorial on Apache Spark Unit Testing☆37Updated 9 years ago
- A connector for SingleStore and Spark☆162Updated 3 weeks ago
- Provides a Pythonic interface for reading and writing Avro schemas☆27Updated 2 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆51Updated last month
- A Spark metrics sink that pushes to InfluxDb☆51Updated 4 years ago
- Apache Flink as a Cloudera Manager Service☆12Updated 9 years ago