rushitjasani / Wikipedia-Search-Engine
A complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.
☆18Updated 5 years ago
Alternatives and similar repositories for Wikipedia-Search-Engine:
Users that are interested in Wikipedia-Search-Engine are comparing it to the libraries listed below
- Big Data webapp using Chicago street congestion, crashes, red light violations, and speed camera violations☆41Updated 4 years ago
- This project's aim was to implement various Recommendation Models on Hadoop Framework and to compare their performance.☆25Updated 7 years ago
- Playground for pyspark (RDDs, DStreams) and Apache Airflow. Based on the example of parsing (including incorrectly formated strings) web …☆16Updated 3 years ago
- Data warehouse implementation for an e-commerce website “Infibeam” that sells digital and consumer electronics.☆19Updated 7 years ago
- ☆56Updated last year
- Big data projects implemented by Maniram yadav☆52Updated 6 years ago
- Data cleaning, pre-processing, and Analytics on a Health care data using Spark and Python.☆47Updated last year
- Machine Learning Projects☆14Updated 3 years ago
- Case Studies and Projects in Machine Learning/EDA/DL☆19Updated 8 months ago
- Cyber Security for Big Data and IoT using Machine Learning☆14Updated 6 years ago
- A data science project to predict whether a transaction is a fraud or not.☆162Updated 2 weeks ago
- 4 different Big Datasets joined to get single table for final data analysis. Fraud Detection by taken consideration of different key feat…☆46Updated 4 years ago
- Customers in the telecom industry can choose from a variety of service providers and actively switch from one to the next. With the help …☆71Updated 3 years ago
- This complete project is made as a part of Data Science Internship at iNeuron.ai Refer to the README for more detailed explanation about …☆19Updated 5 months ago
- A Machine Learning Model to analyse the nature of tweets and classify them as Positive/Negative☆14Updated 4 years ago
- ☆13Updated 2 years ago
- The objective of this assignment is to extract textual data articles from the given URL and perform text analysis to compute variables th…☆28Updated 2 years ago
- A streamlit app to analyze your whatsapp chats☆84Updated 10 months ago
- Crop Recommendation System Using Machine Learning☆58Updated 5 months ago
- End to end projects-- Customer Churning prediction using Gradient Boost Classifier Algorithm perform pre-processing steps then fit data i…☆19Updated last year
- A content based movie recommender system using cosine similarity☆164Updated 7 months ago
- An end-to-end machine learning project, student performance indicator. The goal of this project is to understand the influence of the par…☆11Updated last year
- Tutorial LInk:-☆36Updated last year
- Hi Everyone Glad to see your interest in this repo and welcome, we will be working on end to end data science project which is "Loan Pred…☆40Updated 2 years ago
- ☆29Updated last year
- A ml model to detect emotion from text☆15Updated 8 months ago
- ☆11Updated 5 years ago
- Data Science Capstone Project Using Python and Tableau 10☆50Updated 2 years ago
- In this repo, I upload all-time series forecasting projects☆14Updated 3 years ago
- This project is on the datasets manually created by me over a period of so many weeks which covers 1M records generated on a random basis…☆25Updated last year