Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
☆95May 19, 2021Updated 4 years ago
Alternatives and similar repositories for Movies-Analytics-in-Spark-and-Scala
Users that are interested in Movies-Analytics-in-Spark-and-Scala are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Big data projects implemented by Maniram yadav☆50May 5, 2018Updated 7 years ago
- All my projects on Big Data are provided☆27Dec 5, 2016Updated 9 years ago
- Predicting hit songs on Spotify by classifying 40,000 songs using Machine Learning☆10Jun 29, 2022Updated 3 years ago
- Big Data webapp using Chicago street congestion, crashes, red light violations, and speed camera violations☆44Jan 9, 2021Updated 5 years ago
- A complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki page…☆19Oct 16, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Rint Network is free and open-source project for enabling anonymous, encryption & communication by directing Internet traffic through a w…☆23Aug 31, 2021Updated 4 years ago
- Template for Scala Spark with Unit Test☆13Jul 24, 2023Updated 2 years ago
- SIEM, Visibility, and Event-Driven Architecture Curated Solutions. Build a cost-effective threat detection and log management system.☆18Jan 17, 2024Updated 2 years ago
- A shell script to automate the operations of sqoop☆11Mar 29, 2021Updated 5 years ago
- Docker configuration to build linux chromium☆16Jun 8, 2022Updated 3 years ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆15May 22, 2023Updated 2 years ago
- Web Scraping monster.com using scrapy with JSON APIs☆10Oct 18, 2019Updated 6 years ago
- Data Structures and Algos☆13Apr 8, 2023Updated 3 years ago
- Handy Reusable Utilities☆22Nov 13, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Python Essentials for AWS Cloud Developers, published by Packt.☆11Apr 27, 2023Updated 2 years ago
- Using Natural Language Processing to standardize Company Names☆11Aug 4, 2021Updated 4 years ago
- 🟣 Classification Algorithms interview questions and answers to help you prepare for your next machine learning and data science intervie…☆11Jan 4, 2026Updated 3 months ago
- 🟣 Explainable Ai interview questions and answers to help you prepare for your next machine learning and data science interview in 2026.☆12Jan 4, 2026Updated 3 months ago
- A custom AWS credential provider that allows your Hadoop or Spark application access S3 file system by assuming a role☆10Jan 9, 2026Updated 3 months ago
- 🟣 Ml Design Patterns interview questions and answers to help you prepare for your next machine learning and data science interview in 20…☆17Jan 4, 2026Updated 3 months ago
- 🟣 Neural Networks interview questions and answers to help you prepare for your next machine learning and data science interview in 2026.☆14Jan 4, 2026Updated 3 months ago
- 🟣 Ensemble Learning interview questions and answers to help you prepare for your next machine learning and data science interview in 202…☆11Jan 4, 2026Updated 3 months ago
- 🟣 Feature Engineering interview questions and answers to help you prepare for your next machine learning and data science interview in 2…☆16Jan 4, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A data pipeline with Kafka, Spark Streaming, dbt, Docker, Airflow, and GCP!☆12Jul 6, 2023Updated 2 years ago
- A big data project to develop a real-time data pipeline for analyzing the popularity and sentiments of trending topics on Twitter.☆24Jun 21, 2022Updated 3 years ago
- Efficient Global Optimization☆10Feb 26, 2016Updated 10 years ago
- capstone project for Dataengineer.io bootcamp Public Repo☆12Feb 20, 2024Updated 2 years ago
- 🟣 RNN interview questions and answers to help you prepare for your next machine learning and data science interview in 2026.☆15Jan 4, 2026Updated 3 months ago
- ☆12Nov 11, 2019Updated 6 years ago
- 🟣 K Nearest Neighbors interview questions and answers to help you prepare for your next machine learning and data science interview in 2…☆12Jan 4, 2026Updated 3 months ago
- Vietnam stock price crawling☆21Dec 8, 2022Updated 3 years ago
- 🟣 Naive Bayes interview questions and answers to help you prepare for your next machine learning and data science interview in 2026.☆14Jan 4, 2026Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- My personal space on internet☆10Apr 25, 2021Updated 4 years ago
- tests for javascript developers☆14Mar 1, 2023Updated 3 years ago
- Playground for pyspark (RDDs, DStreams) and Apache Airflow. Based on the example of parsing (including incorrectly formated strings) web …☆18Feb 21, 2022Updated 4 years ago
- The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.☆14Jul 22, 2021Updated 4 years ago
- Learn Power BI, second edition, published by Packt.☆41Mar 2, 2026Updated last month
- The solutions of all SQL hackerrank challenges using MySQL environment☆809Apr 30, 2024Updated last year
- 🟣 Model Evaluation interview questions and answers to help you prepare for your next machine learning and data science interview in 2026…☆17Jan 4, 2026Updated 3 months ago