shivassg / Bigdata-project
Analyzing Uber Movement Dataset
☆24Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Bigdata-project
- The demo of using Kafka, Spark, Hive, Cassandra, etc by using Docker. It produces the production ready environment for any kinds of big d…☆31Updated 5 years ago
- Easy to use Multi-Provider ASR/Speech To Text and NLP engine☆28Updated 2 years ago
- A corporate Slack-like messenger built with the YugabyteDB database, Vaadin, Spring Boot, and Kong.☆9Updated last year
- Component for lazy image loading. Written in Vue js.☆8Updated 3 years ago
- Yet another Python API and CLI for Jenkins☆15Updated last year
- Apache Spark based framework for analysis A/B experiments☆11Updated last week
- Numerous - an object-oriented modelling and simulation engine.☆14Updated last year
- Training_Documents☆11Updated 9 months ago
- 基于Spark2.2新闻网大数据实时系统项目☆61Updated 5 years ago
- Welcome to this repository, dedicated to providing solutions to list of 75 of the most common problems on LeetCode!☆21Updated 11 months ago
- Lectures and hands-on on cloud computing infrastructures☆29Updated 2 months ago
- 基于spark的外卖大数据平台分析系统☆40Updated 5 years ago
- A few end to end examples that use data-describe☆16Updated last year
- ☆19Updated 5 months ago
- 基于flink的推荐系统,实时获取kafka数据进行数据清洗,离线计算进行文件读取(文件,mongodb,hbase)运用协同过滤算法进行计算得出推荐数据☆18Updated 2 years ago
- A collection of my favorite tech-related blog posts.☆9Updated this week
- Spark大型项目实战:电商用户行为分析大数据平台\Spark大型项目实战:电商用户行为分析大数据平台(史上第一套高端大数据项目实战课程)☆27Updated last year
- 使用Hadoop、Spark等实现的大数据平台项目☆20Updated 2 years ago
- The project leverages Apache Flink, Apache Kafka and Python digital Twin to provide real-time insights into healthcare data, enabling tim…☆10Updated last year
- This git repository contains different files and examples used during the advanced devops course.☆42Updated last week
- Open-source vector database built to embedding similarity search☆10Updated 3 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- 此项目是对大学生的一卡通消费数据、图书借阅记录和图书馆门禁数据在spark集群的大数据框架环境之下进行聚类、关联分析,分析出学生的消费水平、生活规律、学习强度等聚类结果,以及将聚类结果进行FPGrowth关联分析得出学生聚类之间存在的关联性,此项目是使用scala语言,利用…☆60Updated last month
- Apache NiFi 1.5/1.6/1.9.2+ Processor to produce DDL☆11Updated last year
- A Basic Flink Application Consuming & Aggregating Kafka Messages☆10Updated 5 years ago
- ODTP: A tool designed to manage, run, and design digital twins.☆11Updated this week
- Hackerank Programming Challenges☆9Updated 3 years ago
- Free programming language books☆10Updated 4 years ago