insperatum / pregex
Probabilistic regular expressions
☆18Updated 5 years ago
Related projects: ⓘ
- Pipeline components that support partial_fit.☆42Updated 2 months ago
- Bag of, not words, but tricks!☆68Updated 10 months ago
- Finds linguistic patterns effortlessly☆31Updated last year
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Updated 2 years ago
- ☆17Updated last year
- A Python library for creating adversarial splits☆13Updated 2 years ago
- Vectorizers for a range of different data types☆92Updated 3 weeks ago
- A spaCy custom component that extracts and normalizes temporal expressions☆53Updated last year
- A python package to simulate typographical errors.☆30Updated 9 months ago
- 🧪 Cutting-edge experimental spaCy components and features☆94Updated 4 months ago
- Repo contains Jupyter notebooks compiled during my review of the programming books listed.☆13Updated 2 years ago
- KEN: Relational Data Embeddings☆27Updated 8 months ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated last year
- Rich Context leaderboard competition, including the corpus and current SOTA for required tasks.☆21Updated 3 years ago
- Repository for my master thesis on automated string handling☆16Updated 3 years ago
- Prune your sklearn models☆19Updated last year
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 2 years ago
- SciWING is a modern toolkit for scientific document processing from WING-NUS☆61Updated last year
- spaCy match and replace, maintaining conjugation☆34Updated last year
- ☆65Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆56Updated last year
- Transform a corpus of text documents (any kind) into a map with different zoom levels and topics names to summarise sub corpus of similar…☆26Updated 8 months ago
- Converter from UD-trees to BART representation☆37Updated 6 months ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆72Updated 2 months ago
- A reference implementation of algorithms for distributions over spanning trees.☆21Updated 4 years ago
- Document parameters using comments☆10Updated 3 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Easy-to-use text representations extraction library based on the Transformers library.☆32Updated last year
- ☆38Updated this week
- Agents that build knowledge graphs and explore textual worlds by asking questions☆76Updated last year
- A collection of utilities for writing labeling functions, transformation functions, and slicing functions.☆20Updated 4 years ago