ScJa / document-search-engineLinks
A really fast document ranking engine using BM25 and TF-IDF. Based on Python using NLP packages NLTK and spacY.
☆16Updated 7 years ago
Alternatives and similar repositories for document-search-engine
Users that are interested in document-search-engine are comparing it to the libraries listed below
Sorting:
- Document Search Engine project with TF-IDF abd Google universal sentence encoder model☆54Updated 2 years ago
- Document Search Engine Tool☆74Updated 2 years ago
- An app that extracts your twitter threads into a downloadable CSV file.☆11Updated 2 years ago
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.☆77Updated last year
- Semantic Search Engine using BERT embeddings☆33Updated 5 years ago
- Example for Logging LLM Evaluator Prompt Responses☆18Updated 2 years ago
- This repo is about the classification of rhetorical roles in Legal Documents such as: Citation, Findings of Fact, Evidence, Legal Rule, R…☆16Updated 3 years ago
- This repository serves as a collection of scrapers procuring and structuring various legal datasets☆17Updated 2 years ago
- Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large text collections with ML and for ML☆64Updated 9 months ago
- A dataset for pretraining language models targeted for legal tasks.☆138Updated 3 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆25Updated 3 years ago
- Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legal☆98Updated 2 years ago
- CaseText Court Case analysis with fine-tuned BERT Transformer☆14Updated 5 years ago
- 🖍️ Highlight text in documents☆109Updated 5 months ago
- Simple pdf to text with python using PDFtk and PyPDF2☆21Updated 2 years ago
- Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPI☆113Updated 2 years ago
- This is a proof-of-concept of using an LLM to find and extract meaningful data without parsing the html too much.☆30Updated 2 years ago
- Developing a Knowledge Graph-based Question and Answering program to extract information from huge dataset☆95Updated 2 years ago
- Solve Geometric & Graph Problems with Large Language Models☆33Updated 2 years ago
- Mining Legal Arguments in Court Decisions - Data and software☆70Updated 2 years ago
- Implementation of different summarization algorithms applied to legal case judgements.☆212Updated 2 years ago
- clustering news, extract trending news stories☆12Updated 4 years ago
- The official gpt4free repository | various collection of powerful language models☆10Updated last year
- NLP Cloud serves high performance pre-trained or custom models for NER, sentiment-analysis, classification, summarization, paraphrasing, …☆86Updated 10 months ago
- Model for predicting categories of entities by its mentions☆29Updated 4 years ago
- Search PDFs using Jina, DocArray and Jina Hub☆56Updated 3 years ago
- Daily TV News Summary using GPT☆23Updated 5 months ago
- ☆63Updated last year
- Various Jupyter notebooks about Common Crawl data☆59Updated 6 months ago
- Keyword Extraction and Analysis Pipeline & Application with KeyBERT and Taipy☆17Updated 2 years ago