rushitjasani / Wikipedia-Search-Engine

A complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.
17Updated 5 years ago

Related projects

Alternatives and complementary repositories for Wikipedia-Search-Engine