JustlyAI / lmss_entity_extractor
Tool to apply Legal Matter Specification Standard (LMSS) to documents
☆13Updated 8 months ago
Alternatives and similar repositories for lmss_entity_extractor:
Users that are interested in lmss_entity_extractor are comparing it to the libraries listed below
- Next-generation Punkt sentence boundary detection with zero dependencies☆16Updated last month
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆30Updated 3 weeks ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated last year
- Using short models to classify long texts☆21Updated 2 years ago
- Small python package to measure OCR quality and other related metrics.☆21Updated last year
- A simple library for segmenting legal texts☆15Updated 2 years ago
- ☆43Updated 2 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 8 months ago
- Analysis on the cost of encoder based models☆11Updated 2 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆46Updated 3 weeks ago
- Efficient few-shot learning with cross-encoders.☆51Updated last year
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.☆25Updated last year
- Python library to use Pleias-RAG models☆36Updated this week
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- NLP with Rust for Python 🦀🐍☆62Updated 11 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 6 months ago
- utilities for loading and running text embeddings with onnx☆44Updated 8 months ago
- ☆20Updated last year
- Lightweight tools for quick and easy LLM demo's☆26Updated 7 months ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- ☆19Updated 6 months ago
- ☆14Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated last month
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 7 months ago
- This repository serves as a collection of scrapers procuring and structuring various legal datasets☆17Updated last year
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus☆14Updated 4 years ago
- 🤝 Trade any tensors over the network☆30Updated last year
- Synthetic text dataset generation☆9Updated this week
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆33Updated 11 months ago