alea-institute / nupunktLinks
Next-generation Punkt sentence boundary detection with zero dependencies
β27Updated last month
Alternatives and similar repositories for nupunkt
Users that are interested in nupunkt are comparing it to the libraries listed below
Sorting:
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- Small python package to measure OCR quality and other related metrics.β25Updated last year
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.β59Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documentsβ12Updated last year
- Named entity recognition for the legal domainβ42Updated 4 years ago
- β55Updated 2 years ago
- πΈ Train floret vectorsβ18Updated 2 years ago
- Language detection using Spacy and Fasttextβ57Updated 2 years ago
- LegalCrawler: A tool for automated scraping of English legal corporaβ59Updated 3 years ago
- spaCy entry points for Curated Transformersβ32Updated 7 months ago
- A simple library for segmenting legal textsβ17Updated 2 years ago
- Generate reports for spaCy models.β29Updated 3 years ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.β21Updated last year
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.β21Updated last year
- Source code and data for Like a Good Nearest Neighborβ30Updated last year
- β30Updated 3 years ago
- β68Updated 3 years ago
- Code for SaGe subword tokenizer (EACL 2023)β27Updated last year
- β70Updated 3 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidataβ169Updated 3 years ago
- Python based Wikidata framework for easy dataframe extractionβ45Updated 2 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporatedβ¦β26Updated 3 years ago
- It's a cooler way to store simple linear models.β27Updated last year
- β20Updated 4 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.β40Updated 6 years ago
- Rust python bindings for symspellβ21Updated 2 years ago
- β67Updated last year
- My NER Experiments with ModernBERT and Ettinβ26Updated 5 months ago
- Mining Legal Arguments in Court Decisions - Data and softwareβ73Updated 2 years ago
- Python package for deduplication/entity resolution using active learningβ83Updated last year