insperatum / pregexLinks
Probabilistic regular expressions
☆18Updated 6 years ago
Alternatives and similar repositories for pregex
Users that are interested in pregex are comparing it to the libraries listed below
Sorting:
- Matrix tools for building and inspecting latent spaces☆27Updated 6 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated 2 years ago
- Finds linguistic patterns effortlessly☆36Updated last year
- KenLM extension for spaCy 2.0.☆16Updated 7 years ago
- ☆17Updated last year
- Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick☆17Updated 7 years ago
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Updated 2 years ago
- Converter from UD-trees to BART representation☆36Updated last year
- Vectorizers for a range of different data types☆101Updated 4 months ago
- Code for the paper "The Surprising Computational Power of Nondeterministic Stack RNNs" (DuSell and Chiang, 2023)☆18Updated last year
- A reference implementation of algorithms for distributions over spanning trees.☆21Updated 5 years ago
- Pipeline components that support partial_fit.☆46Updated 10 months ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Updated 4 years ago
- Bag of, not words, but tricks!☆68Updated last year
- 🧪 Cutting-edge experimental spaCy components and features☆99Updated last year
- Tokenize and clean strings in Python☆12Updated 7 years ago
- Bayesian Assessment of Hypotheses☆24Updated last year
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 11 months ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 6 months ago
- A pedagogical, functional-oriented deep learning library built on top of jax.☆15Updated 3 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Repo contains Jupyter notebooks compiled during my review of the programming books listed.☆13Updated 3 years ago
- The ntentional blog - a machine learning journey☆23Updated 2 years ago
- Easy-to-use text representations extraction library based on the Transformers library.☆32Updated 2 years ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Updated 4 years ago
- A PyPI package for easy text annotation in a Jupyter Notebook.☆28Updated 3 years ago
- Align the token outputs from Spacy and Huggingface to help understand what language structures transformers see☆44Updated 3 years ago
- ☆30Updated 3 years ago
- A list of resources dedicated to compositionality☆14Updated 6 years ago