projecte-aina / spacy
Pre-production releases for Spacy in Catalan
☆14Updated 3 years ago
Alternatives and similar repositories for spacy:
Users that are interested in spacy are comparing it to the libraries listed below
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languages☆10Updated last year
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago
- ☆30Updated 2 years ago
- Provides the implementation of a topic detection framework developed for the MULTISENSOR project.☆9Updated 8 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- Labeled segmentation for the document structure of printed books☆13Updated 7 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆26Updated 3 years ago
- Finds linguistic patterns effortlessly☆35Updated last year
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated last year
- 🌸 Train floret vectors☆18Updated last year
- Tool for sentiment analysis annotation☆12Updated 4 months ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- ☆17Updated 6 months ago
- ☆23Updated 2 years ago
- An alternative approach for probabilistic topic modeling based on agglomerative clustering of topics (not documents)☆12Updated 3 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 2 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- ☆20Updated 3 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆40Updated last year
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated 2 years ago
- ☆64Updated 2 years ago
- The RadioTalk dataset of talk radio transcripts☆57Updated 4 years ago
- MinHash implementation in Python☆11Updated 5 months ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated 11 months ago
- List of corpora annotated for coreference for different languages☆17Updated 6 months ago
- extract relationships from standardized terms from corpus of interest with deep learning☆20Updated 5 years ago