jmyrberg / finnlem
Neural network based lemmatizer for Finnish language
☆11Updated 4 years ago
Related projects: ⓘ
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 3 years ago
- Dataframe Integration with spaCy.☆100Updated 3 years ago
- 🚀GUI for training spaCy models☆53Updated 3 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated last year
- Calculate readability scores☆40Updated 5 years ago
- Language detection using Spacy and Fasttext☆53Updated 9 months ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆90Updated last year
- ☆53Updated 8 months ago
- Sentiment Corpus for Swedish 🇸🇪 Norwegian 🇳🇴 Danish 🇩🇰 Finnish 🇫🇮 (and English 🏴)☆15Updated 3 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆151Updated last year
- Featurize words into orthographic and phonological vectors.☆39Updated last year
- BERT and ELECTRA models trained on Europeana Newspapers☆35Updated 2 years ago
- German lemmatization with IWNLP as extension for spaCy☆23Updated last year
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Notebooks configured to be run with Binder, usually found on my blog.☆41Updated last year
- ☆32Updated 6 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 5 months ago
- German sentiment scores with SentiWS as extension for spaCy☆36Updated last year
- spaCy + UDPipe☆159Updated 2 years ago
- Clean personally identifiable information from dirty dirty text using spaCy.☆40Updated last year
- A fully customisable language detection pipeline for spaCy☆93Updated 5 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated 6 months ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago
- Simple perceptron tagger trained using the NLTK on the NLCOW14 corpus.☆25Updated 6 years ago
- linguistics backend☆40Updated last year
- ☆24Updated 4 years ago
- ☆65Updated 2 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated last year
- The project proposes a framework to apply topic models on a text-corpus and eventually topic labels on the generated topics.☆36Updated 4 months ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆83Updated last year