mongodb / mongo-spark
The MongoDB Spark Connector
☆708Updated 3 weeks ago
Related projects: ⓘ
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,008Updated last year
- Read - Write JSON SerDe for Apache Hive.☆733Updated 9 months ago
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.☆552Updated 3 years ago
- A connector for Spark that allows reading and writing to/from Redis cluster☆935Updated 3 months ago
- CSV Data Source for Apache Spark 1.x☆1,053Updated 5 years ago
- Avro Data Source for Apache Spark☆539Updated 5 years ago
- REST job server for Apache Spark☆2,844Updated 2 months ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆880Updated this week
- Mirror of Apache Oozie☆707Updated 2 months ago
- High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…☆632Updated 2 years ago
- Kafka Connect HDFS connector☆7Updated last week
- Apache Phoenix☆1,021Updated this week
- Connect Spark to HBase for reading and writing data with ease☆297Updated 6 years ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,353Updated last year
- Scala examples for learning to use Spark☆444Updated 4 years ago
- Mirror of Apache Bahir☆337Updated last year
- The Internals of Spark Structured Streaming☆415Updated last year
- Elasticsearch real-time search and analytics natively integrated with Hadoop☆1,928Updated last week
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,214Updated this week
- Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)☆727Updated last week
- Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover…☆516Updated 4 years ago
- Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces☆321Updated 2 years ago
- Examples for High Performance Spark☆497Updated 3 weeks ago
- Spark reference applications☆656Updated 7 months ago
- Docker build for Apache Spark☆676Updated 2 years ago
- LinkedIn's previous generation Kafka to HDFS pipeline.☆882Updated 4 years ago
- ☆765Updated 3 years ago
- Mirror of Apache Sqoop☆969Updated 3 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,039Updated last year
- SparkOnHBase☆279Updated 3 years ago