michal-harish / kafka-hadoop-loaderView external linksLinks
Hadoop Job for schemaless incremental loading of messages from Kafka topics onto hdfs with configurable output partitioning.
☆90Oct 2, 2016Updated 9 years ago
Alternatives and similar repositories for kafka-hadoop-loader
Users that are interested in kafka-hadoop-loader are comparing it to the libraries listed below
Sorting:
- Java client to connect directly to Impala using thrift☆33Apr 12, 2017Updated 8 years ago
- Library of different Bloom filters in Java with optional Redis-backing, counting and many hashing options.☆20Aug 21, 2022Updated 3 years ago
- Capture the logical plan from Spark (SQL)☆22Mar 6, 2021Updated 4 years ago
- Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream …☆22Feb 6, 2017Updated 9 years ago
- THIS REPOSITORY IS DEPRECATED☆19Jul 6, 2023Updated 2 years ago
- A table schema-less OLAP Analytics Engine for Big Data.☆24Apr 23, 2024Updated last year
- LinkedIn's previous generation Kafka to HDFS pipeline.☆883Aug 27, 2020Updated 5 years ago
- A netty-spring-based web controller framework.☆17Feb 16, 2016Updated 10 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,260Jan 15, 2026Updated last month
- Presto connector for Apache Kudu☆48Mar 22, 2019Updated 6 years ago
- assembly-examples☆26Jun 16, 2024Updated last year
- flumeng-kafka-plugin☆77Aug 31, 2015Updated 10 years ago
- A Spark SQL HBase connector☆29May 4, 2015Updated 10 years ago
- Parse Redis dump.rdb file☆31Aug 30, 2016Updated 9 years ago
- spring-boot利用scala写spark程序骨架☆28Oct 22, 2019Updated 6 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Sep 8, 2022Updated 3 years ago
- send mongo oplog stream to kafka☆29Sep 1, 2015Updated 10 years ago
- Apache Spark based ETL Engine☆71Oct 18, 2016Updated 9 years ago
- Connect Spark to HBase for reading and writing data with ease☆295Dec 19, 2017Updated 8 years ago
- LASER-A Scalable Response Prediction Platform For Online Advertising☆48Sep 23, 2014Updated 11 years ago
- A Windows information stealer / credential stealer written in Go for security research and malware analysis. Demonstrates browser passwo…☆26Dec 15, 2025Updated 2 months ago
- The code in this repository which function is to extract the shellcode from the maldoc.☆10Jul 17, 2023Updated 2 years ago
- Integration of Iceberg table management into Spark SQL☆11Jan 21, 2020Updated 6 years ago
- ☆11Aug 14, 2014Updated 11 years ago
- 平时玩hadoop做的例子。☆10Feb 15, 2017Updated 9 years ago
- ☆10May 8, 2018Updated 7 years ago
- A timer module for Redis☆11Oct 16, 2019Updated 6 years ago
- Interplanetary Database: A Database built on top of IPFS and made immutable using Ethereum blockchain.☆10Sep 19, 2022Updated 3 years ago
- Apache Spark Web Monitor Tool, varOne☆36Aug 26, 2016Updated 9 years ago
- ☆10Aug 13, 2021Updated 4 years ago
- 卸载拼夕夕☆12Jan 12, 2021Updated 5 years ago
- Crypto trader using signals or automation☆16Jan 25, 2018Updated 8 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Jul 9, 2025Updated 7 months ago
- ☆11Dec 10, 2015Updated 10 years ago
- ☆20Jun 29, 2022Updated 3 years ago
- Scripts for running Apache Kafka on Mesosphere's Marathon☆14Dec 6, 2015Updated 10 years ago
- An Apache Spark-shell backend for IPython☆105Jul 2, 2021Updated 4 years ago
- Open-source distribute workflow schedule tools, also support streaming task.☆39Nov 11, 2017Updated 8 years ago
- 基于Java,封装了hbase的底层api,提供了基于注解的ORM支持,只需定义实体类对象,即可完成对hbase的各种操作。同时对List、Set、Map等复杂数据类型提供了支持☆42Dec 5, 2016Updated 9 years ago