JustlyAI / lmss_entity_extractor
Tool to apply Legal Matter Specification Standard (LMSS) to documents
β12Updated 7 months ago
Alternatives and similar repositories for lmss_entity_extractor:
Users that are interested in lmss_entity_extractor are comparing it to the libraries listed below
- Small python package to measure OCR quality and other related metrics.β21Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.β47Updated 6 months ago
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Updated 11 months ago
- Knowledge Graph Generator appβ30Updated 11 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progrβ¦β28Updated 2 months ago
- A simple library for segmenting legal textsβ15Updated last year
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.β25Updated last year
- ChatBot App built using LangChain and Lightning AIβ18Updated 2 years ago
- Tools to make language models a bit easier to useβ39Updated last week
- Using modal.com to process FineWeb-edu dataβ20Updated 2 weeks ago
- This repository serves as a collection of scrapers procuring and structuring various legal datasetsβ17Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β67Updated 4 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Modelsβ21Updated 3 months ago
- Using short models to classify long textsβ21Updated 2 years ago
- Code interpreter support for o1β32Updated 6 months ago
- API client for fetching and comparing passages from legislationβ11Updated last month
- Writing Blog Posts with Generative Feedback Loops!β47Updated last year
- LLM plugin for embeddings using sentence-transformersβ52Updated last month
- a pipeline for using api calls to agnostically convert unstructured data into structured training dataβ29Updated 5 months ago
- β30Updated 8 months ago
- β22Updated last year
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpusβ14Updated 4 years ago
- utilities for loading and running text embeddings with onnxβ44Updated 7 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ58Updated last year
- Embedding models from Jina AIβ58Updated last year
- Efficient few-shot learning with cross-encoders.β49Updated last year
- β48Updated 4 months ago
- examples and guides to using Nomic Atlasβ27Updated 3 weeks ago