ScJa / document-search-engine
A really fast document ranking engine using BM25 and TF-IDF. Based on Python using NLP packages NLTK and spacY.
☆16Updated 6 years ago
Alternatives and similar repositories for document-search-engine:
Users that are interested in document-search-engine are comparing it to the libraries listed below
- Document Search Engine project with TF-IDF abd Google universal sentence encoder model☆53Updated last year
- Neural Elastic Inference and Search☆19Updated 5 years ago
- An app that extracts your twitter threads into a downloadable CSV file.☆10Updated last year
- Sample datasets of over 400 Instagram coding influencers☆11Updated last month
- simple rule based named entity recognition☆43Updated 3 years ago
- Document Search Engine Tool☆72Updated 2 years ago
- Developing a Knowledge Graph-based Question and Answering program to extract information from huge dataset☆96Updated last year
- Use Kore.ai's Knowledge Graph Generator to automatically extract terms from FAQs, define the hierarchy between these terms, and also asso…☆15Updated last year
- NLP Cloud serves high performance pre-trained or custom models for NER, sentiment-analysis, classification, summarization, paraphrasing, …☆80Updated 4 months ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆37Updated 5 years ago
- Perform Latent Dirichlet Allocation on scientific articles with Gensim☆15Updated 5 years ago
- sequence tagging with spaCy and crfsuite☆19Updated 2 years ago
- Model for predicting categories of entities by its mentions☆29Updated 3 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- AI models for automatic job application pipeline (user CV, job description analysis (customized NER/SpaCy) and artificial cover letter ge…☆35Updated 10 months ago
- ☆64Updated last year
- a repo for the cord19 challenge☆32Updated last year
- This repo is about the classification of rhetorical roles in Legal Documents such as: Citation, Findings of Fact, Evidence, Legal Rule, R…☆14Updated 3 years ago
- The Selenium scraper that collected a million stories from Medium.com☆80Updated 6 years ago
- This is a tutorial I made on how to deploy a HuggingFace/LangChain pipeline on the newly released Falcon 7B LLM by TII☆10Updated last year
- An implementation of bidirectional LSTM-CRF for Named Entity Relationship on custom corpus with custom word embeddings☆13Updated 5 years ago
- Named entity recognition for the legal domain☆43Updated 3 years ago
- ☆70Updated 4 years ago
- GPT-3 Chatbot with long-term memory and external sources. Original work & inspiration by @daveshap☆17Updated 2 years ago
- Python SDK for the TextRazor Text Analytics API☆20Updated last year
- ☆14Updated 3 years ago
- Examples of RAG using LangChain with local LLMs - Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆37Updated last year
- 📃 A contracts clause summarization system using LLM and vector database☆16Updated last month
- No Teacher BART distillation experiment for NLI tasks☆27Updated 4 years ago
- Building Chatbots with Rasa,Spacy,Wit.Ai,etc☆30Updated 6 years ago