ngrams-dev / generalLinks
NGRAMS is a search engine for the Google Books Ngram Dataset. This repository contains documentation, discussions, announcements, and issues.
☆22Updated last month
Alternatives and similar repositories for general
Users that are interested in general are comparing it to the libraries listed below
Sorting:
- ☆55Updated 2 years ago
- Powerful topic model visualization in Python☆141Updated 10 months ago
- This repository provides various Python methods for finding and aggregating synonyms for an individual word or a list of words.☆36Updated 2 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆58Updated 4 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆134Updated last year
- RosaeNLG is a Natural Language Generation library for node.js and browser rendering, based on the Pug template engine.☆106Updated last year
- spaCy entry points for Curated Transformers☆32Updated 8 months ago
- spaCy REST API, wrapped in a Docker container.☆16Updated 4 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆184Updated 8 months ago
- Suite of generic Linked Data/SPARQL as well as LinkedDataHub-specific MCP tools☆34Updated last week
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆96Updated 2 years ago
- Robust and fast topic models with sentence-transformers.☆89Updated this week
- 🦦 weasel: A small and easy workflow system☆90Updated 2 months ago
- A sentence segmentation library with wide language support optimized for speed and utility.☆86Updated 2 weeks ago
- an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction (mirror of https://…☆37Updated last month
- 🔢 Work with static vector models☆36Updated 9 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆170Updated 3 years ago
- Blazing fast topic modelling for short texts.☆34Updated last month
- Stylometry library for Burrows' Delta method☆46Updated 6 months ago
- Tools for interactive visual exploration of semantic embeddings.☆42Updated last year
- Python Multilingual Ucrel Semantic Analysis System☆35Updated 2 weeks ago
- A simple toolkit for conducting analyses using corpus methods☆27Updated 4 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated last year
- A machine learning tool for fishing entities☆270Updated 8 months ago
- `pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.☆105Updated last year
- A spaCy wrapper for GliNER☆129Updated last year
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- CLI for loading Wikidata subsets (or all of it) into Elasticsearch☆71Updated 4 years ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆81Updated 2 years ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆358Updated 4 years ago