alea-institute / nupunktLinks
Next-generation Punkt sentence boundary detection with zero dependencies
☆17Updated last month
Alternatives and similar repositories for nupunkt
Users that are interested in nupunkt are comparing it to the libraries listed below
Sorting:
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆13Updated 9 months ago
- Small python package to measure OCR quality and other related metrics.☆22Updated last year
- A simple library for segmenting legal texts☆17Updated 2 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆38Updated 5 years ago
- Named entity recognition for the legal domain☆42Updated 3 years ago
- API client for fetching and comparing passages from legislation☆11Updated 4 months ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- CLI that queries multiple language models in parallel using prompts from a CSV file☆26Updated last week
- spaCy entry points for Curated Transformers☆31Updated this week
- 🌸 Train floret vectors☆18Updated 2 years ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆18Updated 9 months ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated last year
- Library for fast text representation and classification.☆28Updated last year
- ☆18Updated 3 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- ☆55Updated last year
- ☆22Updated 4 months ago
- LLM plugin for embeddings using sentence-transformers☆62Updated last month
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- ☆30Updated 2 years ago
- 🔢 Work with static vector models☆28Updated last month
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆31Updated last month
- Efficient few-shot learning with cross-encoders.☆52Updated last year
- KL3M training data collection and preprocessing☆11Updated last month
- Python package for extractive NLP using the OpenAI API☆17Updated 9 months ago
- Plug-and-play document processing pipelines with zero-shot models.☆63Updated 3 weeks ago
- Code for SaGe subword tokenizer (EACL 2023)☆25Updated 6 months ago
- Adding Marimo to Datasette☆20Updated 2 months ago
- ☆17Updated 2 years ago
- Python library to use Pleias-RAG models☆51Updated last month