ngrams-dev / generalLinks
NGRAMS is a search engine for the Google Books Ngram Dataset. This repository contains documentation, discussions, announcements, and issues.
☆22Updated last month
Alternatives and similar repositories for general
Users that are interested in general are comparing it to the libraries listed below
Sorting:
- A sentence segmentation library with wide language support optimized for speed and utility.☆86Updated 2 weeks ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆184Updated 8 months ago
- Blazing fast topic modelling for short texts.☆34Updated last month
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆58Updated 4 years ago
- This repository provides various Python methods for finding and aggregating synonyms for an individual word or a list of words.☆36Updated 2 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆134Updated last year
- Powerful topic model visualization in Python☆141Updated 10 months ago
- This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.☆127Updated last year
- Gather modern English word frequencies from all enwiki articles.☆228Updated last year
- A Flexible Deep Learning Approach to Fuzzy String Matching☆150Updated last year
- ☆55Updated 2 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆170Updated 3 years ago
- Multilingual syllable annotation pipeline component for spacy☆39Updated 2 years ago
- 📂 Additional lookup tables and data resources for spaCy☆113Updated 8 months ago
- CLI for loading Wikidata subsets (or all of it) into Elasticsearch☆71Updated 4 years ago
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆22Updated last month
- an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction (mirror of https://…☆37Updated last month
- Verb forms dictionary☆70Updated 8 years ago
- spaCy entry points for Curated Transformers☆32Updated 8 months ago
- Named-Entity Recognition for Norwegian Bokmål and Nynorsk☆12Updated 6 years ago
- Neural syntax annotator, supporting sequence labeling, lemmatization, and dependency parsing.☆80Updated 2 years ago
- A TextBlob sentiment analysis pipeline component for spaCy.☆57Updated 3 months ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆261Updated 5 months ago
- A machine learning tool for fishing entities☆270Updated 8 months ago
- A spaCy wrapper for DBpedia Spotlight☆113Updated 2 years ago
- Preliminary spaCy models for Latin☆14Updated 3 years ago
- Gutenberg cache and query library☆46Updated 2 months ago
- Discourse Analysis Tool Suite☆41Updated this week
- Machine-readable lists of lemma-token pairs in 23 languages.☆358Updated 4 years ago
- an interactive visual tool for exploring ideologies of political parties from up to date WikiData, using SPARQL, D3js, and PixiJS☆17Updated 4 years ago