基于python3使用spark的统计分析,涵盖spark的几大模块,主要有spark core、spark mllib、spark sql及spark streaming等的python实现
☆32Oct 16, 2018Updated 7 years ago
Alternatives and similar repositories for Spark-for-Python
Users that are interested in Spark-for-Python are comparing it to the libraries listed below
Sorting:
- Extension that allows you to receive and publish events via HTTP and https transports☆16Dec 19, 2025Updated 2 months ago
- Extension that can be used to receive events from a Kafka cluster and to publish events to a Kafka cluster☆18Mar 15, 2025Updated 11 months ago
- Spark Streaming examples using python☆15Dec 17, 2015Updated 10 years ago
- A tool for translating Scala source code into readable and maintainable Java code☆13Jan 3, 2026Updated last month
- A batch-processing system base on Spring Boot and Spring Batch. 一个基于SpringBoot和SpringBatch的批处理系统。☆10Sep 10, 2018Updated 7 years ago
- 《2021医学健康数据分析与挖掘》课程论文 -- 基于BERT的20NewsGroups数据集新闻分类实验☆10Jun 22, 2021Updated 4 years ago
- Spark projects. Learning book "Machine Learning with Spark"☆10Jun 3, 2017Updated 8 years ago
- zdh系列-基于java的经营风控引擎☆13Jan 24, 2026Updated last month
- 本地解析+存储的Epub电子书阅读器☆10Jul 11, 2023Updated 2 years ago
- 这是居于 derby 源代码,通过删减的方式,从里面抽取出sql解析功能。并在此基础上开发出跨库连接查询器。通过该工具可以将连接查询分割成多个单表查询,再将单表结果集进行连接,即将数据库的连接功能上移到工具执行。详情可以查看wiki:readme☆10Feb 14, 2017Updated 9 years ago
- ☆11Sep 1, 2022Updated 3 years ago
- Creates a Lucene index out of files from a local folder☆13Aug 8, 2014Updated 11 years ago
- Python and Scala APIs for enhanced Spark analytics☆12Mar 15, 2017Updated 8 years ago
- Graphene图数据建模工具| Tool for visually creating a schema for graph database.☆14Apr 12, 2023Updated 2 years ago
- 一个使用 Python 且基于 Flask Web 框架开发的 MVC 架构的个人博客系统。☆13May 21, 2024Updated last year
- Web, Admin & API - TypeScript, React, Next.js GraphQL, Apollo, Express, Docker, Mongo monorepo boilerplate☆10Jan 4, 2023Updated 3 years ago
- 使用shell脚本部署Apache Doris (incubating) FE & BE☆11Jul 8, 2019Updated 6 years ago
- Index what you read for search.☆10Dec 12, 2019Updated 6 years ago
- LightRAG with Neo4j Example Project☆17May 19, 2025Updated 9 months ago
- ☆14Apr 12, 2022Updated 3 years ago
- 📚🔖 A basic PHP-based platform that allows users to share and access important resource links. 🚀 This initiative aims to provide a seam…☆13Aug 25, 2024Updated last year
- ☆10Nov 18, 2020Updated 5 years ago
- Notes for the book Fluent Python, 1st Edition (O'Reilly, 2015)☆11Jun 30, 2022Updated 3 years ago
- A custom watcher plugin for Elasticsearch that feeds Apache Kafka☆11Mar 9, 2018Updated 7 years ago
- 编译语言实现模式例程☆11Nov 22, 2014Updated 11 years ago
- Example project to show how to use Kafka from Spark Streaming with the Confluent schema registry☆11Aug 17, 2016Updated 9 years ago
- Code Server☆12Jun 28, 2021Updated 4 years ago
- Python自动化办公☆13Nov 5, 2021Updated 4 years ago
- Twitter Spider 推特爬虫,支持搜索关键词采集推文数据,采集相关用户,采集用户主页推文☆10Dec 31, 2019Updated 6 years ago
- Current state of frontend development is controversial. How we got here?☆11Oct 13, 2020Updated 5 years ago
- Meedan's Open Source Arabic/English Translation Memory☆33Nov 4, 2009Updated 16 years ago
- ☆14Nov 16, 2022Updated 3 years ago
- chinese word segmentation based on rnn☆13Oct 14, 2016Updated 9 years ago
- Google Guice component management System!☆10Sep 24, 2021Updated 4 years ago
- DSL for make a simple ruby GUI application☆16Nov 6, 2018Updated 7 years ago
- 智能BI平台☆10Apr 20, 2024Updated last year
- ServiceFramework 示例项目☆10Apr 2, 2016Updated 9 years ago
- 用户画像代码,根据算法推算出用户的性别和年龄比率☆11Dec 18, 2017Updated 8 years ago
- Innoshop is an Open Source eCommerce System based on Laravel 11, supporting multiple languages, multiple currencies, integrated with Open…☆16Jan 18, 2026Updated last month