A complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.
☆19Oct 16, 2019Updated 6 years ago
Alternatives and similar repositories for Wikipedia-Search-Engine
Users that are interested in Wikipedia-Search-Engine are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Big Data webapp using Chicago street congestion, crashes, red light violations, and speed camera violations☆44Jan 9, 2021Updated 5 years ago
- ☆12Jul 22, 2025Updated 9 months ago
- This is the LinkedIn Learning repository for Level Up: Python Data Acquisitions, Prep, & EDA.☆15Mar 4, 2025Updated last year
- An end-to-end ETL pipeline that extracts weather data, transforms it, and loads it into a PostgreSQL database.☆13Sep 6, 2024Updated last year
- Python Essentials for AWS Cloud Developers, published by Packt.☆11Apr 27, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Project submission for data engineering zoomcamp 2023 - https://github.com/DataTalksClub/data-engineering-zoomcamp☆10Apr 27, 2023Updated 3 years ago
- Speech Emotion Recognition Project☆11Dec 15, 2019Updated 6 years ago
- Hackathon project☆12Mar 20, 2022Updated 4 years ago
- Streamlit Dashboard over Superstore Data stored in Postgres Docker container. With SQLAlchemy + Plotly Express☆12Oct 16, 2024Updated last year
- End-to-end data engineering pipeline with various technologies to ingest real time data.☆26Nov 3, 2023Updated 2 years ago
- This is a guided certification project, as a part of Data Science for Social Good initiative☆18Mar 9, 2020Updated 6 years ago
- 📚🧪 Traffic Sentinel is a learning-focused POC that explores a scalable IoT architecture using Fog nodes and Apache Flink to process 📷 …☆28Dec 29, 2025Updated 4 months ago
- Dapplo.CaliburnMicro is a Caliburn bootstrapper (and more) to quickly start with a WPF MVVM Application☆21May 14, 2021Updated 4 years ago
- A revolutionary AI-powered platform to help you solve doubts instantly, make learning easy, and achieve academic success.☆16Nov 1, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This repo is for the Linkedin Learning course: Testing Python Data Science Code☆21Sep 26, 2025Updated 7 months ago
- Case Studies and Projects in Machine Learning/EDA/DL☆24Jun 18, 2024Updated last year
- weighted category-balanced dataset builder for LLM fine-tuning☆16Feb 21, 2026Updated 2 months ago
- Mifos Fineract Client is a Java based library that provides a simple interface to interact with the Apache Fineract 1.x Platform APIs☆16Mar 26, 2025Updated last year
- my zsh configuration☆13Jun 26, 2025Updated 10 months ago
- VBA code of worksheet functions for linear and bilinear interpolation based on interp1 and interp2 in MATLAB☆28Aug 24, 2021Updated 4 years ago
- ☆17Feb 11, 2022Updated 4 years ago
- ☆30Jan 17, 2023Updated 3 years ago
- A simple Dash and Plotly dashboard to review and compare federal economic data☆13Feb 1, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of a system capable of encryption and decryption of multimedia data (Text, Images, Videos, Audio etc.) using a hybrid mode…☆22Feb 7, 2024Updated 2 years ago
- MCP server for managing and serving analysis prompt templates☆23Dec 13, 2024Updated last year
- ☆15Jul 31, 2022Updated 3 years ago
- Simple way to send ether.☆24Nov 23, 2020Updated 5 years ago
- Sign language translation model for the app Look & Tell https://github.com/khooinguyeen/LookandTell-OfficialApp☆26Apr 16, 2023Updated 3 years ago
- This dataset contain information of hotel booking, We have performed exploratory data analysis in python to get insight from the data.☆13Apr 12, 2020Updated 6 years ago
- ☆25Apr 23, 2022Updated 4 years ago
- Disease Prediction based on Symptoms.☆342Feb 15, 2023Updated 3 years ago
- Complete PySpark Guide for the beginners... I prepared this notebook for my students.☆19Sep 18, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- MLgenerator is a web app which help you to generate machine learning starter code with ease.☆34Feb 5, 2021Updated 5 years ago
- Snowflake Data Engineering in Action☆39Oct 18, 2024Updated last year
- ☆24May 21, 2024Updated last year
- This is a Messenger App, made with react, styled with the help of material UI, and deployed with the help of firebase. 💭🖥️☆19Apr 10, 2022Updated 4 years ago
- Streaming Anomaly Detection Solution by using Pub/Sub, Dataflow, BQML & Cloud DLP☆192Jan 5, 2026Updated 4 months ago
- ☆33Mar 2, 2026Updated 2 months ago
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆51Dec 4, 2023Updated 2 years ago