ml-for-nlp / build-your-search-engine
A repository to learn basic data processing techniques (Wikipedia processing, feature selection), and use them for some basic Web query classification.
☆24Updated 2 years ago
Alternatives and similar repositories for build-your-search-engine:
Users that are interested in build-your-search-engine are comparing it to the libraries listed below
- Examples of vector DB indexing and query with various vector databases.☆12Updated last week
- Generate a SQLite database from Wikipedia & Wikidata dumps.☆33Updated 10 months ago
- Graph databases, Knowledge Graphs, SPARQ☆76Updated 3 years ago
- This is a Fact based Question Answering System using Apache Solr as backend search engine, Wikipedia dumps as information source, Apache …☆26Updated 2 years ago
- 🚀GUI for training spaCy models☆54Updated 3 years ago
- Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for impr…☆52Updated last year
- ln2sql as a python package☆17Updated 5 years ago
- Natural Language Processing☆28Updated 11 months ago
- Generating flashcards from lecture notes☆20Updated 2 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Semantic Parser Localizer (SPL) code repository☆9Updated 3 years ago
- ☆42Updated last week
- Machine Learning for Information Retrieval☆86Updated last week
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- Do everything from data collection from reddit to training a machine learning model in just two lines of python code!☆82Updated last year
- Named entity recognition for the legal domain☆41Updated 3 years ago
- This repository contains code and data download instructions for the workshop paper "Improving Hierarchical Product Classification using …☆17Updated 3 years ago
- NLP engine code☆13Updated this week
- A Directory of Online Newspaper Sources for 70+ Languages☆33Updated 3 years ago
- Natural Language Generation for Gramex applications.☆24Updated 2 years ago
- Information extraction from English and German texts based on predicate logic☆135Updated last year
- CorrectLy - Open Source Spelling & Grammar correction☆40Updated 2 years ago
- Ask Me: Question Generating Agent☆13Updated 6 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆37Updated 5 years ago
- Language detection using Spacy and Fasttext☆55Updated last year
- Language Models for Zalando's flair library☆61Updated 5 years ago
- Summarize. is a Streamlit application that performs automatic text summarization using both extractive and abstractive models.☆16Updated 3 years ago
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.☆25Updated last year
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 6 years ago
- Build Semantic Search with S-BERT and Fine-tune your model in unsupervised way☆58Updated 2 years ago