mraad / spark-dbf
Spark SQL DBF Library
☆16Updated 10 years ago
Alternatives and similar repositories for spark-dbf:
Users that are interested in spark-dbf are comparing it to the libraries listed below
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- something to help you spark☆65Updated 6 years ago
- Mirror of Apache MRQL (Incubating)☆17Updated 7 years ago
- Cascading on Apache Flink®☆54Updated last year
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 7 years ago
- XPath likeness for Avro☆35Updated 2 years ago
- Edit code in IntelliJ, eval/run in Zeppelin notebook☆18Updated 6 years ago
- Mirror of Apache Lens☆60Updated 5 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- Scala stuff☆18Updated 5 years ago
- functionstest☆33Updated 8 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆61Updated last year
- Experiments with the GDELT dataset and Cassandra schemas.☆25Updated 9 years ago
- Analyzing Twitter real time feed with Spark Streaming☆32Updated 10 years ago
- Example how to integrate Esper with Akka in the form of an Akka event bus☆29Updated 10 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Updated last year
- Bucketing and partitioning system for Parquet☆30Updated 6 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Updated 8 years ago
- Akka persistance plugin implementation with Apache Ignite☆21Updated 6 years ago
- Provides a SQL interface to your TinkerPop enabled graph db☆74Updated last year
- ☆41Updated 7 years ago
- PMML evaluator library for the Apache Hive data warehouse software (legacy codebase)☆13Updated 10 years ago
- Multidimensional data storage with rollups for numerical data☆266Updated last year
- Joins for skewed datasets in Spark☆57Updated 7 years ago
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Updated 8 years ago