ActianCorp / spark-vector
Repository for the Spark-Vector connector
☆20Updated 3 years ago
Alternatives and similar repositories for spark-vector:
Users that are interested in spark-vector are comparing it to the libraries listed below
- Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.☆50Updated 8 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 7 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 8 years ago
- Spark SQL UDF examples☆56Updated 7 years ago
- Example project showing how to use Hive UDFs in Apache Spark☆55Updated 6 years ago
- ☆39Updated 6 years ago
- Helpful user defined fuctions / table generating functions for Hive☆101Updated 9 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆61Updated last year
- Will come later...☆20Updated 2 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- Code used in "Pro Spark Streaming: The Zen of Real-time Analytics using Apache Spark" published by Apress Publishing.☆48Updated 9 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 8 years ago
- Elastic Search on Spark☆112Updated 10 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆92Updated 8 years ago
- ElasticSearch integration for Apache Spark☆47Updated 9 years ago
- NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase☆51Updated 10 years ago
- ☆33Updated 9 years ago
- SQL Windowing Functions for Hadoop☆65Updated 2 years ago
- An Apache access log parser written in Scala☆72Updated 4 years ago
- This is the example code repository for Getting Started with Impala by John Russell (O'Reilly Media)☆22Updated 7 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Spark MLlib code optimized to efficiently support sparse data☆51Updated 8 years ago
- graphx example☆24Updated 9 years ago
- Testbench for experimenting with Apache Hive at any data scale.☆64Updated 7 years ago
- Read SparkSQL parquet file as RDD[Protobuf]☆93Updated 6 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 5 years ago