A really fast document ranking engine using BM25 and TF-IDF. Based on Python using NLP packages NLTK and spacY.
☆17May 8, 2018Updated 8 years ago
Alternatives and similar repositories for document-search-engine
Users that are interested in document-search-engine are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Used Python, NLTK, NLP techniques to make a search engine that ranks documents based on search keyword, based on TF-IDF weights and cosin…☆17Jul 11, 2017Updated 8 years ago
- Example for Logging LLM Evaluator Prompt Responses☆18Aug 14, 2023Updated 2 years ago
- ☆12Feb 11, 2023Updated 3 years ago
- The official gpt4free repository | various collection of powerful language models☆10Mar 22, 2024Updated 2 years ago
- Document Search Engine project with TF-IDF abd Google universal sentence encoder model☆55May 1, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Testing speed and cost of classification via LLM or via vector embeddings☆21Aug 6, 2023Updated 2 years ago
- Sample datasets of over 400 Instagram coding influencers☆14Feb 20, 2025Updated last year
- Telegram archive viewer☆15Apr 17, 2022Updated 4 years ago
- Logistic regression, text emotion classifier web application (with Streamlit), from data preprocession to model productionizing and deplo…☆15Oct 7, 2025Updated 8 months ago
- Detect user's sex by name☆17Mar 12, 2019Updated 7 years ago
- Multilingual emotion analysis research☆21Apr 8, 2024Updated 2 years ago
- Writing Primer for Data Scientists☆18Feb 19, 2020Updated 6 years ago
- ☆18Dec 25, 2021Updated 4 years ago
- GPT-3 Chatbot with long-term memory and external sources. Original work & inspiration by @daveshap☆17Jan 29, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Web UI with search and navigation for Telegram chat / channel dump (export) from JSON☆18May 23, 2022Updated 4 years ago
- This repository will teach you how to Use ChatGPT API with Python in Just 5 Minutes☆15Mar 2, 2023Updated 3 years ago
- Document Search Engine Tool☆79Dec 8, 2022Updated 3 years ago
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago
- An experimental open-source attempt to make GPT-4 fully autonomous.☆17Apr 3, 2023Updated 3 years ago
- A GPT client with long term memory☆40May 26, 2023Updated 3 years ago
- ☆23Apr 28, 2026Updated last month
- scrapy best practice☆38Sep 30, 2020Updated 5 years ago
- ☆12May 17, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Swizec as a bot with chatbot-ui☆15Jun 10, 2023Updated 3 years ago
- Poems retrieval demo built with GNES framework☆14Oct 3, 2019Updated 6 years ago
- [DEPRECATED - Please use https://github.com/pylons/pyramid-cookiecutter-starter instead] A Cookiecutter (project template) for creating a…☆41Oct 31, 2018Updated 7 years ago
- SIGIR'20: An Analysis of BERT in Document Ranking☆21Jul 27, 2020Updated 5 years ago
- 🔬 Sharing your data science notebooks with the community has never been this easy.☆44Oct 20, 2022Updated 3 years ago
- A modern, opinionated Cookiecutter template for building async Python web APIs powered by Aiohttp, SQLAlchemy 2.0, and PostgreSQL.☆52Apr 1, 2026Updated 2 months ago
- Scrap Medium Articles using tags.☆43Jun 13, 2019Updated 6 years ago
- A cookiecutter template to help you make new JupyterLab theme extensions☆50Jan 25, 2022Updated 4 years ago
- A batteries-included, opinionated template for Django Rest Framework APIs☆41Dec 8, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- WSDM2021 Tutorial: Beyond Probability Ranking Principle: Modeling the Dependencies among Documents☆23Mar 12, 2021Updated 5 years ago
- A dataset of semantically related sentence pairs in the German legal domain☆10Feb 26, 2021Updated 5 years ago
- TextComplexityDE dataset consists of 1000 sentences in the German language with subjective complexity rating, collected from German learn…☆12Apr 8, 2022Updated 4 years ago
- Question and Answer for CSV using langchain and OpenAI☆57May 29, 2023Updated 3 years ago
- ☆40Feb 8, 2022Updated 4 years ago
- Implementation of "A Neural Probabilistic Language Model" by Yoshua Bengio et al. - Tensorflow☆11Feb 2, 2023Updated 3 years ago
- Spark is an Auto-GPT alternative that uses LocalAI.☆89Oct 6, 2023Updated 2 years ago