stdatalabs / inverted-index
An implementation of inverted index in Mapreduce and Spark
☆13Updated 8 years ago
Alternatives and similar repositories for inverted-index:
Users that are interested in inverted-index are comparing it to the libraries listed below
- A Spark Streaming App to analyze the popular hashtags based on keywords☆24Updated 8 years ago
- 基于Spark SQL,可通过输入SQL语句操作HBase表,目前提供对HBase表的查询、创建、删除以及数据插入(需要自己指定rowKey生成规则)的功能,数据删除,分布式导入大规模数据相关功能正在开发中☆12Updated 7 months ago
- Spark1.6和spark2.2的示例,包含kafka,flume,structuredstreaming,jedis,elasticsearch,mysql,dataframe☆15Updated 7 years ago
- Spark SQL UDF examples☆56Updated 7 years ago
- graphx example☆24Updated 9 years ago
- Spark projects. Learning book "Machine Learning with Spark"☆10Updated 7 years ago
- Profiling Spark Applications for Performance Comparison and Diagnosis☆16Updated 6 years ago
- Twitter sentiment analysis using Spark and Stanford CoreNLP and visualization using elasticsearch and kibana☆20Updated 7 years ago
- Simple Flink + Kafka application☆41Updated 8 years ago
- Spark Streaming HBase Example☆22Updated 9 years ago
- Apache Flink Hairless Notes☆13Updated 2 years ago
- An example of Spark and GraphX with Twitter as sample☆19Updated 8 years ago
- Self-contained examples using Apache Spark with the functional features of Java 8☆64Updated 7 years ago
- ☆105Updated 5 years ago
- Example Maven configuration for a Spark, Scala project☆54Updated 3 years ago
- 使用Flink实现用户行为分析☆11Updated 4 years ago
- Showcasing Online analytical processing with Apache Kylin☆8Updated 6 years ago
- Getting started with Spark, Spark streaming, Spark SQL and DataFrame.☆48Updated 6 years ago
- An analysis on Aadhaar dataset using Mapreduce and Spark☆14Updated 7 years ago
- Hive,Pig,Hbase,Sqoop examples☆16Updated 8 years ago
- SparkLearning_NoData, including code,pom and so on☆13Updated 8 years ago
- spark将hdfs数据高性能灌入kafka,然后spark streaming/structured streaming高速消费,关注性能,欢迎提供性能/代码优化建议☆33Updated 6 years ago
- UDF, GenericUDF, UDTF, UDAF☆12Updated 2 years ago
- Examples To Help You Learn Apache Spark☆77Updated 6 years ago
- Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficient…☆55Updated 2 years ago
- Experiments with Apache Flink.☆5Updated last year
- The project implemented some machine learning algorithms on spark which is written in scala and it also included standalone implementatio…☆15Updated 3 years ago
- Building blocks and patterns for building data prep transformations and feature engineering in Spark.☆16Updated 9 years ago
- ☆13Updated 7 months ago
- spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Updated last year