Spark Java_Examples for all modules including GraphX
☆19Dec 8, 2017Updated 8 years ago
Alternatives and similar repositories for Apace-Spark-Examples
Users that are interested in Apace-Spark-Examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 知网、万方、专利局爬虫☆11Mar 20, 2019Updated 7 years ago
- 基于WebCollector的新浪微博爬虫及相关登录工具,如新浪微博Cookie获取☆14Nov 21, 2018Updated 7 years ago
- graphx example☆24Jan 23, 2016Updated 10 years ago
- ☆34Apr 9, 2015Updated 10 years ago
- 百度百科多线程爬虫Java源码,数据存储采用了Oracle11g☆13Feb 23, 2017Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 一些后台开发中常用的活动算法,大转盘,翻牌,刮刮卡,抢红包,洗牌 and so on ...☆13Dec 27, 2019Updated 6 years ago
- A complete custom processor project, for your reference.☆17Sep 29, 2015Updated 10 years ago
- ☆13Jul 6, 2023Updated 2 years ago
- Flume Sink for writing events directly to parquet files in HDFS☆18May 19, 2017Updated 8 years ago
- spark graphx 的原理及相关操作的源码解析☆215Dec 11, 2016Updated 9 years ago
- ☆11Apr 12, 2024Updated last year
- CS224N 2019 Homeworks☆17Feb 22, 2019Updated 7 years ago
- 一个集分布式爬虫,分布式存储,分布式计算统计分析一体的统计分析数据挖掘项目☆14Feb 6, 2018Updated 8 years ago
- Book <Spark GraphX In Action> code and resources.☆26May 1, 2017Updated 8 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- 新浪微博,微信,知乎,头条爬虫,支持新浪登录打码获取cookie实现登录☆16Jul 3, 2017Updated 8 years ago
- A demo repo for using keylines with IBM Graph☆11Dec 20, 2016Updated 9 years ago
- 依照The Annotated Transformer 的指导实现Transformer, 并加入进去详细的描述,适合小白☆11Feb 2, 2020Updated 6 years ago
- 1、支持网页爬虫 2、多线程、线程池 3、支持全文搜索 4、支持Hadoop分布式平台、HDFS/MapReduce、Zookeeper、HBase 5、支持redis分布式缓存 6、集成微信公众号开发 7、Spring4新特性 8、ActiveMQ 9、Nginx详细配置…☆16Nov 16, 2022Updated 3 years ago
- ☆14Feb 2, 2023Updated 3 years ago
- Official repository for PraPR source code☆14May 11, 2021Updated 4 years ago
- Crisis Event Extraction Service (CREES)☆15Feb 4, 2019Updated 7 years ago
- ☆19May 13, 2021Updated 4 years ago
- Java利用HtmlUtil和jsoup爬取知网中国专利数据的爬虫程序☆15Mar 21, 2019Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Library for testing Neo4j code over REST☆13Nov 14, 2020Updated 5 years ago
- ☆20Jun 29, 2024Updated last year
- ☆18Nov 19, 2017Updated 8 years ago
- Python script for importing DBpedia nodes and relationships into Neo4j☆14Mar 15, 2014Updated 12 years ago
- This is the official implement for the paper 'Domain Adaptive Code Completion via Language Models and Decoupled Domain Databases''☆14Oct 4, 2023Updated 2 years ago
- 京东商品推荐系统-数据爬虫☆18Apr 9, 2015Updated 10 years ago
- Attempt to understand Percy Liang's Dependency-based Compositional Semantics by implementing it in Python☆10Mar 10, 2013Updated 13 years ago
- 酷玩 Spark: Spark 源代码解析、Spark 类库等☆13Dec 6, 2017Updated 8 years ago
- ☆14Aug 20, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- spark streaming从kafka读取消息,offset写入Redis,spark计算单词出现频率,最后写入hive表☆17Jul 30, 2019Updated 6 years ago
- 用java写的搜狐新闻爬虫☆14May 2, 2017Updated 8 years ago
- Structure-Invariant Testing for Machine Translation [ICSE'20]☆16Dec 17, 2020Updated 5 years ago
- ☆16Apr 26, 2021Updated 4 years ago
- java爬虫,反爬虫策略、ETL清洗数据,以及spark离线和实时分析新闻并存入ES☆19Nov 26, 2018Updated 7 years ago
- A partial runnable code repo for annotated-transformer☆20Nov 14, 2019Updated 6 years ago
- The RunBugRun dataset of executable bugs☆24Sep 24, 2025Updated 6 months ago