ScJa / document-search-engineLinks
A really fast document ranking engine using BM25 and TF-IDF. Based on Python using NLP packages NLTK and spacY.
☆16Updated 7 years ago
Alternatives and similar repositories for document-search-engine
Users that are interested in document-search-engine are comparing it to the libraries listed below
Sorting:
- Document Search Engine project with TF-IDF abd Google universal sentence encoder model☆54Updated 2 years ago
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.☆78Updated last year
- An app that extracts your twitter threads into a downloadable CSV file.☆11Updated 2 years ago
- Document Search Engine Tool☆76Updated 3 years ago
- Sample datasets of over 400 Instagram coding influencers☆13Updated 9 months ago
- Semantic Search Engine using BERT embeddings☆33Updated 5 years ago
- Summarize text content into a Tweet-sized statement using OpenAI's GPT-3 based Davinci model☆23Updated last year
- Used Python, NLTK, NLP techniques to make a search engine that ranks documents based on search keyword, based on TF-IDF weights and cosin…☆17Updated 8 years ago
- Testing speed and cost of classification via LLM or via vector embeddings☆21Updated 2 years ago
- Write beautifully short contract. https://reference.legal/ is a referenceable clause library to standardize contracts once and for all.☆13Updated 3 years ago
- NLP-based Contract Analysis☆12Updated 8 years ago
- Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legal☆99Updated 2 years ago
- Lobe is the world's first AI paralegal.☆51Updated 3 years ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 8 months ago
- Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPI☆112Updated 2 years ago
- The official gpt4free repository | various collection of powerful language models☆10Updated last year
- CaseText Court Case analysis with fine-tuned BERT Transformer☆14Updated 5 years ago
- A dataset for pretraining language models targeted for legal tasks.☆140Updated 3 years ago
- This repo is about the classification of rhetorical roles in Legal Documents such as: Citation, Findings of Fact, Evidence, Legal Rule, R…☆16Updated 3 years ago
- ☆63Updated last year
- Example for Logging LLM Evaluator Prompt Responses☆18Updated 2 years ago
- Generate True or False questions from any content with OpenAI GPT2 text generation, Sentence-BERT semantic search and Berkley constituenc…☆34Updated 5 years ago
- A demo that shows how to build a semantic search experience with Typesense's vector search feature and Instantsearch.js☆27Updated 2 years ago
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus☆13Updated 4 years ago
- NLP Cloud serves high performance pre-trained or custom models for NER, sentiment-analysis, classification, summarization, paraphrasing, …☆88Updated last year
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.☆26Updated 2 years ago
- clustering news, extract trending news stories☆12Updated 4 years ago
- Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large text collections with ML and for ML☆65Updated 10 months ago
- ☆63Updated 2 years ago
- 🖍️ Highlight text in documents☆109Updated 7 months ago