rushitjasani / Wikipedia-Search-Engine
A complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.
☆18Updated 5 years ago
Alternatives and similar repositories for Wikipedia-Search-Engine:
Users that are interested in Wikipedia-Search-Engine are comparing it to the libraries listed below
- This project's aim was to implement various Recommendation Models on Hadoop Framework and to compare their performance.☆25Updated 7 years ago
- Big Data webapp using Chicago street congestion, crashes, red light violations, and speed camera violations☆41Updated 4 years ago
- Data warehouse implementation for an e-commerce website “Infibeam” that sells digital and consumer electronics.☆19Updated 6 years ago
- Cyber Security for Big Data and IoT using Machine Learning☆14Updated 6 years ago
- Playground for pyspark (RDDs, DStreams) and Apache Airflow. Based on the example of parsing (including incorrectly formated strings) web …☆16Updated 2 years ago
- Crop Recommendation System Using Machine Learning☆52Updated 4 months ago
- Predict your Medical insurance cost!☆82Updated 4 months ago
- Multi-class classification model for predicting the types of crimes in Toronto☆14Updated 10 months ago
- ☆54Updated last year
- ☆45Updated 2 years ago
- 4 different Big Datasets joined to get single table for final data analysis. Fraud Detection by taken consideration of different key feat…☆46Updated 4 years ago
- A ml model to detect emotion from text☆14Updated 7 months ago
- Case Studies and Projects in Machine Learning/EDA/DL☆18Updated 7 months ago
- This repository contain Data Analysis on Black Friday Sales Data using various Regression ML algorithms☆14Updated 4 years ago
- Big data projects implemented by Maniram yadav☆51Updated 6 years ago
- Effortlessly convert YouTube lectures to concise notes with our AI transcriber. Streamline learning and comprehension with just a click!☆66Updated 11 months ago
- Data visualisations in Power BI☆20Updated 3 years ago
- ☆101Updated 3 years ago
- Sales insights project using Powerbi and SQL☆27Updated last year
- ☆38Updated 7 months ago
- As a part of my internship with iNeuron.ai, I worked independently on an end-to-end project "Heart Disease Diagnostic Analysis".☆45Updated 3 years ago
- Heart Strokes Predictions ML Model In Production☆44Updated 2 years ago
- ☆13Updated 2 years ago
- ☆99Updated last year
- Advanced SQL - Discover sequential, step-by-step explanations and solutions, accompanied by the necessary database creation codes, availa…☆22Updated last year
- Medical data extraction from medical documents like prescription and patient details document using python and Regex☆19Updated 2 years ago
- ☆22Updated 3 months ago
- This repo contains Data Science code snippet☆83Updated 3 months ago
- ☆22Updated 10 months ago
- Data science virtual internship program by British Airways through Forage!☆35Updated 2 years ago