spark算子使用例子, spark RDD的算子挺多,有时候如何灵活的使用,该如何用一下子想不起来,这一段时间将spark的算子如何使用的例子给记录了下来,下面是spark RDD 的一些常用算子的使用 这些算子包括有java的,也有scala的语言(博客中才有),由于精力有限,暂时没有python的,以后有空再加上吧
☆35Jul 12, 2018Updated 7 years ago
Alternatives and similar repositories for spark_tutorial
Users that are interested in spark_tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15May 12, 2017Updated 9 years ago
- NSQ as backend for Queue Package☆12May 9, 2026Updated 2 weeks ago
- spark学习中文笔记☆13Mar 26, 2019Updated 7 years ago
- 拓展Mongo Shell,让其支持SQL语句查询,方便开发人员进行简单数据分析和统计。☆11Nov 28, 2017Updated 8 years ago
- 在规格文件上直接执行SQL,无数据库依赖,基于Java8的流计算和Lamdba表达式。☆10Jun 15, 2017Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 博客☆16Sep 17, 2025Updated 8 months ago
- 直接在Hadoop执行SQL,不依赖于Hive和Hbase,纯SQL转MapReduce操作。☆11Jul 7, 2016Updated 9 years ago
- 总结了一些Spark学习过程中的例子(附代码详细注释)☆23Aug 19, 2018Updated 7 years ago
- 基于tensorflow搭建的神经网络recursive autuencode,用于实现句子聚类☆12Jul 7, 2017Updated 8 years ago
- An example in Scala of reading data saved in hbase by Spark and an example of converter for python☆25Jul 6, 2018Updated 7 years ago
- 【Released】🛠Java常用的插件API整理以及基于JDK的一些方法封装库,能在不依赖大型框架下快速进行开发(亦可快速用于测试或者脚本类代码编写 - 含数据库相关)。☆13Apr 10, 2025Updated last year
- 中国身份证校验Java版工具,支持15位转18位,地区校验,生日校验,检验码校验,男女识别,☆28Jan 10, 2019Updated 7 years ago
- Parikh et al., A Decomposable Attention Model for Natural Inference☆17Feb 12, 2018Updated 8 years ago
- 简单BI工具,支持Druid,MySQL等多种数据源,方便拓展。可以进行多种形式图标展示和下载。☆16Jan 6, 2017Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 基于Spark SQL,可通过输入SQL语句操作HBase表,目前提供对HBase表的查询、创建、删除以及数据插入(需要自己指定rowKey生成规则)的功能,数据删除,分布式导入大规模数据相关功能正在开发中☆13Sep 12, 2024Updated last year
- spark流式计算电商商品关注度+推荐系统/关联系统☆14Dec 12, 2017Updated 8 years ago
- Flink Forward China 2018 第一届记录,视频记录 | 文档记录 | 不仅仅是流计算 | More than streaming☆24Jan 13, 2019Updated 7 years ago
- 基于URule-2.1.5开源版本集成SpringBoot-2.0,基于内置源码的集成,方便直接二次开发。☆13Apr 4, 2018Updated 8 years ago
- ☆28Nov 21, 2013Updated 12 years ago
- Real-Time Analysis Integration with Kafka in Apache Spark’s Structured Streaming☆58Mar 24, 2018Updated 8 years ago
- Official code for the paper: Scaling Transformers for Discriminative Recommendation via Generative Pretraining☆29Sep 1, 2025Updated 8 months ago
- Makes working with OWL ontologies in Java easy by auto-generating classes that wrap OWL instances with a convenient API☆17Jul 20, 2010Updated 15 years ago
- Machine learning and Deep learning notes.☆11Sep 10, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Golang implementation of sliding window rate limiter. Unlike other libraries using 'token bucket', this library supports use case such as…☆19Apr 29, 2021Updated 5 years ago
- ☆42Nov 8, 2025Updated 6 months ago
- Library for testing Neo4j code over REST☆13Nov 14, 2020Updated 5 years ago
- This is a Kaggle data mining contest, link: https://www.kaggle.com/c/avazu-ctr-prediction☆11Mar 12, 2015Updated 11 years ago
- 此工程采用SpringBoot + Mybatis + SparkSQL + Hive框架进行集成,支持Kerberos认证。☆21Mar 19, 2018Updated 8 years ago
- 由于BAAI/bge-large-zh 在Hugging Face Clone不下来,手动下载下来,便于使用☆11Sep 16, 2023Updated 2 years ago
- Python script for importing DBpedia nodes and relationships into Neo4j☆14Mar 15, 2014Updated 12 years ago
- spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Jul 19, 2023Updated 2 years ago
- Multi Attention Network:基于论文《Multiway Attention Networks for Modeling Sentence Pairs》实现的检索型自动评论系统☆17Nov 21, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Simple latex sequence diagram generator. This project uses annotated junit tests to generate latex sequence diagrams.☆45Oct 20, 2016Updated 9 years ago
- elasticsearch-jdbc,在elasticsearch-sql的jdbc实验特性基础上完成,可使用sql和rest api的方式执行elasticsearch操作☆18Mar 8, 2019Updated 7 years ago
- gradle java spi plugin, support scala, link https://github.com/delphyne/gradle-serviceloader-manifest☆10Jan 23, 2025Updated last year
- 利用BERT预训练模型进行文本生成,可用于对话、摘要、问题生成等任务。 目前支持策略,词表的插入和删除、自定 义Character Embedding、随机词替换等☆10Jun 1, 2022Updated 3 years ago
- 杭州第六次 Spark & Flink Meetup☆30May 14, 2018Updated 8 years ago
- Java task scheduler to execute threads which dependency is managed by directed acyclic graph☆26Feb 2, 2017Updated 9 years ago
- Spring Cloud 与 Docker 整合使用示例 ,为《使用Spring Cloud与Docker实战微服务》的配套代码。书籍地址:https://github.com/eacdy/spring-cloud-book 。讨论QQ群:157525002(已满)、5648…☆41Oct 15, 2016Updated 9 years ago