mraad / spark-dbf
Spark SQL DBF Library
☆16Updated 10 years ago
Alternatives and similar repositories for spark-dbf:
Users that are interested in spark-dbf are comparing it to the libraries listed below
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- Mirror of Apache MRQL (Incubating)☆17Updated 7 years ago
- Experiments with the GDELT dataset and Cassandra schemas.☆25Updated 9 years ago
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Updated 8 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- Cascading on Apache Flink®☆54Updated last year
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- Simple Spark app that reads and writes Avro data☆31Updated 9 years ago
- Example how to integrate Esper with Akka in the form of an Akka event bus☆29Updated 10 years ago
- functionstest☆33Updated 8 years ago
- Edit code in IntelliJ, eval/run in Zeppelin notebook☆18Updated 5 years ago
- A library for financial and time series calculations on Apache Spark☆28Updated 9 years ago
- A prototype native MongoDB connector for Apache Spark, using Spark's external datasource API☆9Updated 9 years ago
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Updated 8 years ago
- XPath likeness for Avro☆35Updated last year
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 7 years ago
- ☆41Updated 7 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- PMML evaluator library for the Apache Hive data warehouse software (legacy codebase)☆13Updated 10 years ago
- Code for Packt Publishing's Scala Data Analysis Cookbook.☆49Updated 9 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- something to help you spark☆65Updated 6 years ago
- Bucketing and partitioning system for Parquet☆30Updated 6 years ago
- Analyzing Twitter real time feed with Spark Streaming☆32Updated 10 years ago
- Akka persistance plugin implementation with Apache Ignite☆21Updated 6 years ago
- Import and export TensorFlow records from/to Spark☆17Updated 7 years ago