rushitjasani / Wikipedia-Search-Engine
A complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.
☆19Updated 5 years ago
Alternatives and similar repositories for Wikipedia-Search-Engine:
Users that are interested in Wikipedia-Search-Engine are comparing it to the libraries listed below
- Big Data webapp using Chicago street congestion, crashes, red light violations, and speed camera violations☆40Updated 4 years ago
- Cyber Security for Big Data and IoT using Machine Learning☆14Updated 6 years ago
- Playground for pyspark (RDDs, DStreams) and Apache Airflow. Based on the example of parsing (including incorrectly formated strings) web …☆18Updated 3 years ago
- 4 different Big Datasets joined to get single table for final data analysis. Fraud Detection by taken consideration of different key feat…☆46Updated 4 years ago
- This project's aim was to implement various Recommendation Models on Hadoop Framework and to compare their performance.☆25Updated 7 years ago
- ☆57Updated last year
- Multi-class classification model for predicting the types of crimes in Toronto☆14Updated last year
- Data warehouse implementation for an e-commerce website “Infibeam” that sells digital and consumer electronics.☆19Updated 7 years ago
- ☆31Updated last year
- Predict your Medical insurance cost!☆90Updated 7 months ago
- Customers in the telecom industry can choose from a variety of service providers and actively switch from one to the next. With the help …☆74Updated 3 years ago
- Data cleaning, pre-processing, and Analytics on a Health care data using Spark and Python.☆47Updated last year
- Big data projects implemented by Maniram yadav☆51Updated 6 years ago
- Crop Recommendation System Using Machine Learning☆66Updated 7 months ago
- ☆71Updated 3 weeks ago
- This repo contains Data Science code snippet☆82Updated 5 months ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆35Updated last year
- ☆60Updated last year
- A Classification Problem which predicts if a loan will get approved or not.☆39Updated 2 years ago
- Effortlessly convert YouTube lectures to concise notes with our AI transcriber. Streamline learning and comprehension with just a click!☆68Updated last year
- 611noorsaeed / Medicine-Recommendation-System-Personalized-Medical-Recommendation-System-with-Machine-Learning☆103Updated last year
- ☆12Updated 2 years ago
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆17Updated last year
- Salary Prediction Web App With Streamlit☆93Updated 9 months ago
- ☆24Updated 4 months ago
- The notebook files contains the tutorials for web scraping☆22Updated last year
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆20Updated last year
- ☆113Updated last year
- A ml model to detect emotion from text☆16Updated 9 months ago
- Heart Strokes Predictions ML Model In Production☆48Updated 2 years ago