ziky90 / tf-idf-Hadoop-MapReduce
Project from the CTU Big Data course which purpose was to compute tf-idf values for the czech wikipedia
☆10Updated 10 years ago
Alternatives and similar repositories for tf-idf-Hadoop-MapReduce:
Users that are interested in tf-idf-Hadoop-MapReduce are comparing it to the libraries listed below
- Code examples on Apache Spark using python☆106Updated 2 years ago
- GitHub repository with resources about the course "Software Dependability" taught at Unisa in 2019.☆7Updated 5 years ago
- DevOps Tools, Instructions.. etc☆13Updated last year
- Spark Notebook docker image☆10Updated 7 years ago
- Dialogflow agent fulfillment library supporting Dialogflow v2 API☆37Updated last year
- My Git Repo for Csv Data☆20Updated 4 years ago
- Twitter real-time sentiment analysis using Spark Structured Streaming and Python☆18Updated 4 years ago
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 2 years ago
- Coursu is an course recommendation system that aims to ask few questions to a user and based on his/her needs our application recommends…☆20Updated 4 years ago
- CCE-AI Class codes☆8Updated 7 years ago
- Repository Ufficiale del Corso Programmazione con Python per Machine Learning e Artificial Intelligence☆10Updated 5 years ago
- This project introduces PySpark, a powerful open-source framework for distributed data processing. We explore its architecture, component…☆27Updated 5 months ago
- Docker build for Apache Spark☆673Updated 3 years ago
- A simple spark standalone cluster for your testing environment purposses☆567Updated 11 months ago
- ☆157Updated 2 years ago
- Basic Spark examples.☆10Updated 4 years ago
- A sequence2sequence chatbot implementation with TensorFlow.☆100Updated 5 years ago
- collection of iPython notebooks☆405Updated last year
- Twitter Sentiment Analysis using Spark and Kafka☆114Updated 5 years ago
- (Under Construction) I am currently writing a solution from the Medium article "Cracking the Machine Learning Interview," written by Subh…☆87Updated 5 years ago
- The demo of using Kafka, Spark, Hive, Cassandra, etc by using Docker. It produces the production ready environment for any kinds of big d…☆32Updated 5 years ago
- A set of coding challenge for various engineering roles at Isentia☆20Updated 3 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆39Updated 3 years ago
- Repository for a data science starter app using Flask, Angular and Docker. https://medium.com/@dvelsner/deploying-a-simple-machine-learni…☆87Updated 6 years ago
- A Spark cluster setup running on Docker containers☆60Updated 5 years ago
- FLY a Domain Specific Language for scientific computing on the Multi Cloud☆12Updated last year
- Creating a Machine Learning API using Flask - Repository for AV Article☆191Updated 5 years ago
- ☆32Updated 6 years ago
- Apche Spark Structured Streaming with Kafka using Python(PySpark)☆41Updated 5 years ago
- A Project where one can fetch and read tweets and show the analysis like who is most influential☆28Updated last year