基于SparkSQL的电影分析项目实战
☆39Nov 21, 2020Updated 5 years ago
Alternatives and similar repositories for spark_project_practise
Users that are interested in spark_project_practise are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- User behavior log analysis system based on Flink☆24Aug 30, 2020Updated 5 years ago
- A Spark data source for reading Microsoft Excel files☆13Jul 1, 2024Updated last year
- Apache Hudi Demo☆21Apr 24, 2025Updated last year
- gdcache is a pure non-intrusive cache library implemented by golang, you can use it to implement your own cache.☆14Oct 14, 2021Updated 4 years ago
- 使用spark + kudu的案例☆15Sep 13, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Apr 22, 2021Updated 5 years ago
- An easy-to-use, scalable spark streaming ETL tool and sdk☆13Aug 14, 2017Updated 8 years ago
- SparkLearning_NoData, including code,pom and so on☆13Mar 21, 2017Updated 9 years ago
- A dev framework for web backend service based on springboot.☆16Feb 3, 2026Updated 4 months ago
- 通过观看尚硅谷的Flink实战视频,开了一个仓库,记录源码和一些所需要的数据文件,也欢迎大家积极讨论☆17Mar 1, 2021Updated 5 years ago
- Refactored version for https://github.com/shirdrn/document-processor.git☆15Apr 5, 2017Updated 9 years ago
- 大数据相关框架实战项目(Hadoop, Spark, Storm, Flink)☆354Oct 4, 2022Updated 3 years ago
- 智能BI平台☆12Apr 20, 2024Updated 2 years ago
- The out-of-the-box environment to for Hadoop/Spark applications☆38Jan 31, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14May 20, 2026Updated 3 weeks ago
- k8s hadoop,在k8s上快速搭建一个hadoop/hbase/hive环境,很早的项目自已用,腾讯tbds培训,以此为基础(多了一个kafka/flink)搭一套环境练习,又捡起来了☆21Mar 21, 2021Updated 5 years ago
- Set of ETL utils for Spark☆15May 4, 2020Updated 6 years ago
- spark MLlib机器学习实践源码☆10Oct 28, 2016Updated 9 years ago
- ☕【Java 新特性系列】Java 版本任你发,我用 Java 8 。但是多学点这种奇巧的装 X 技巧总没错,何况有些新语法说不定是真香呢。☆20Feb 23, 2021Updated 5 years ago
- AQI air quality analysis based on Hadoop MapReduce☆13Dec 30, 2023Updated 2 years ago
- This repository contains my MSc dissertation project. Iti s an implementation of a streaming GMM algorithm in Spark.☆11Aug 25, 2018Updated 7 years ago
- simbot框架下,mirai组件的springboot快速启动器(starter)☆12Jan 1, 2022Updated 4 years ago
- 一个用go编写的个人博客后端,restful风格api, travis做自动持续集成,docker部署☆12Oct 21, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- how to learn springboot?来learnSpringboot,文章从细节出发手把手教学. 各种Springboot集成实战.☆10Jun 22, 2025Updated 11 months ago
- Influence Maximization in Near-Linear Time: A Martingale Approach Scala implementation☆14Sep 3, 2018Updated 7 years ago
- 基于深度学习-卷积神经网络训练而成的模型来动态识别手写体数字识别, 准确率达到:99.64%☆12Mar 23, 2020Updated 6 years ago
- 大数据自动化部署,包括自动化部署hadoop、hive、hbase、spark、storm等等一系列组件☆71Jul 30, 2018Updated 7 years ago
- From this paper: Density-based clustering for real-time stream data☆10Jan 7, 2017Updated 9 years ago
- java分布式的技术栈demo,个人学习用~☆10Feb 1, 2021Updated 5 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35May 6, 2026Updated last month
- multi objective, single objective optimization, genetic algorithm for multi-objective optimization, particle swarm intelligence, ... impl…☆15May 17, 2020Updated 6 years ago
- Base hadoop/spark/bigdata image with advanced config loading scripts.☆11Nov 3, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 大数据平台相关代码(ES/Hive/Hadoop/hdfs/hbase)☆74Nov 4, 2022Updated 3 years ago
- Temporal IMLinUCB - a solution for Online Influence Maximization problem in Temporal Networks (based on IMLinUCB)☆17May 3, 2024Updated 2 years ago
- 使用“代理”的方式来抓取微信公众账号文章,可以抓取阅读数、点赞数,基于 anyproxy。☆13Nov 11, 2018Updated 7 years ago
- 蜂巢爬虫系统 是一套只需要定义XPath,就可实现爬取网站,APP的系统, 支持多种解析方式(XPath,正则表达式),多种下载方式(HttpClient库, PhantomJs, Selenium),多种输出方式(Excel,MongoDB)。 可不做任何修改发布到Yar…☆10Sep 5, 2016Updated 9 years ago
- version_cache是一个分步式一致性缓存解决方案。☆12Jul 15, 2020Updated 5 years ago
- https://github.com/doocs/advanced-java.git☆11Jun 26, 2019Updated 6 years ago
- 使用Spark Graphx 分析金庸”射雕三部曲“☆47Nov 5, 2020Updated 5 years ago