alea-institute / nupunktLinks
Next-generation Punkt sentence boundary detection with zero dependencies
☆17Updated last month
Alternatives and similar repositories for nupunkt
Users that are interested in nupunkt are comparing it to the libraries listed below
Sorting:
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Updated last year
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- 🌸 Train floret vectors☆18Updated 2 years ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- ☆55Updated last year
- Python based Wikidata framework for easy dataframe extraction☆45Updated last year
- A simple library for segmenting legal texts☆17Updated 2 years ago
- scraping and querying documents for LLMs☆23Updated last month
- 🍏 Make Thinc faster on macOS by calling into Apple's native Accelerate library☆100Updated 2 months ago
- Language detection using Spacy and Fasttext☆57Updated last year
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated last year
- Enhaced version of Wikiextrator: A wikipedia dumps extractor☆19Updated 2 months ago
- Named entity recognition for the legal domain☆42Updated 4 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆38Updated 6 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- ☆76Updated 8 months ago
- Generate reports for spaCy models.☆29Updated 3 years ago
- Mining Legal Arguments in Court Decisions - Data and software☆69Updated 2 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆164Updated 2 years ago
- spaCy entry points for Curated Transformers☆32Updated 3 months ago
- A python package to simulate typographical errors.☆37Updated last year
- The NLP Bias Identification Toolkit☆37Updated 2 years ago
- ☆28Updated last year
- API client for fetching and comparing passages from legislation☆14Updated 7 months ago
- Plug-and-play document processing pipelines with zero-shot models.☆97Updated 3 weeks ago
- PyLate efficient inference engine☆64Updated last month
- 🦦 weasel: A small and easy workflow system☆85Updated last year
- ☆30Updated 3 years ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆18Updated last year