projecte-aina / spacy
Pre-production releases for Spacy in Catalan
☆14Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for spacy
- Catalan ALBERT (A Lite BERT for self-supervised learning of language representations)☆13Updated 4 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago
- A PyPI package for easy text annotation in a Jupyter Notebook.☆27Updated 3 years ago
- Topic modelling with SpaCy, Gensim and Textacy☆19Updated 6 years ago
- ☆22Updated last year
- ☆53Updated 10 months ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated last year
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- Finds linguistic patterns effortlessly☆33Updated last year
- ☆29Updated 2 years ago
- Topic supervised non-negative matrix factorization with sparse matrices☆12Updated 4 years ago
- Labeled segmentation for the document structure of printed books☆13Updated 7 years ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆11Updated 4 months ago
- ☆18Updated 9 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated last year
- Provides the implementation of a topic detection framework developed for the MULTISENSOR project.☆9Updated 8 years ago
- CoreNLG is an easy to use and productivity oriented Python library for Natural Language Generation. It aims to provide the essential tool…☆27Updated 3 years ago
- Tools for interactive visual exploration of semantic embeddings.☆29Updated 2 months ago
- 🌸 Train floret vectors☆18Updated last year
- Tool for sentiment analysis annotation☆11Updated last month
- Python based Wikidata framework for easy dataframe extraction☆39Updated 11 months ago
- Clean personally identifiable information from dirty dirty text using spaCy.☆40Updated last year
- Wayward is a Python package that helps to identify characteristic terms from single documents or groups of documents. It can be used for …☆9Updated 5 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated last year
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated 8 months ago
- Calculates the word error rate of two strings, and the result is written into beautify HTML.☆20Updated 4 years ago
- REMERGE - Multi-Word Expression discovery algorithm☆14Updated 2 years ago
- Text preprocessing tools in python.☆26Updated 6 years ago