arnicas / nlp-tips-and-tricksLinks
Repo originally for a talk at Normconf
☆21Updated 2 years ago
Alternatives and similar repositories for nlp-tips-and-tricks
Users that are interested in nlp-tips-and-tricks are comparing it to the libraries listed below
Sorting:
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated last year
- 🧪 Cutting-edge experimental spaCy components and features☆100Updated last year
- REMERGE - Multi-Word Expression discovery algorithm☆14Updated 2 years ago
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆164Updated 2 years ago
- spaCy + UDPipe☆162Updated 3 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆81Updated last year
- Bag of, not words, but tricks!☆68Updated last year
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- PYthon Automated Term Extraction☆315Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
- Fast computation of Krippendorff's alpha agreement measure in Python.☆148Updated 6 months ago
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆29Updated 5 years ago
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆97Updated 7 months ago
- BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s …☆139Updated 2 years ago
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆151Updated 2 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆82Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- Fuzzy matching and more functionality for spaCy.☆256Updated last year
- 💫 SpaCy wrapper for ConceptNet 💫☆94Updated last year
- Compass-aligned Distributional Embeddings. Align embeddings from different corpora☆40Updated 2 years ago
- Information extraction from English and German texts based on predicate logic☆138Updated 2 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆125Updated last year
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆317Updated 3 months ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆183Updated 3 weeks ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆147Updated 7 months ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆11Updated 6 years ago