shriphani / polyglot-toolboxLinks
Polyglot skipgram embeddings, and their many health benefits
β12Updated 6 years ago
Alternatives and similar repositories for polyglot-toolbox
Users that are interested in polyglot-toolbox are comparing it to the libraries listed below
Sorting:
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddingsβ77Updated 3 years ago
- ππ A browser extension that displays the GPT-2 Log Probability of selected textβ112Updated 2 years ago
- A word2vec negative sampling implementation with correct CBOW update.β261Updated 4 years ago
- This blog post visualize vector norms of FastText embedding and evaluates use of FastText word vector norm multiplied with number of wordβ¦β19Updated 2 years ago
- An easy to use open-source library for advanced Deep Learning and Natural Language Processingβ113Updated last year
- Deep Semantic Code Search aims to explore a joint embedding space for code and description vectors and then use it for a code search applβ¦β66Updated last year
- β19Updated 5 years ago
- Automatically labeling training dataβ107Updated 7 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learningβ43Updated 5 years ago
- Natural Language Data Augmentation Tool for Conversational Systemsβ115Updated 3 years ago
- β123Updated 2 years ago
- Representing research papers as vectors / latent representations.β198Updated 4 years ago
- β98Updated 5 years ago
- A set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classificationβ29Updated last year
- Fast and customizable tokenizationβ67Updated 6 years ago
- McKernel: A Library for Approximate Kernel Expansions in Log-linear Time.β13Updated 3 years ago
- Enso: An Open Source Library for Benchmarking Embeddings + Transfer Learning Methodsβ96Updated 5 years ago
- Discover relevant information about categorical data with entity embeddings using Neural Networks (powered by Keras)β70Updated 3 years ago
- β30Updated 3 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.β21Updated 2 years ago
- β70Updated 3 years ago
- The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniquesβ30Updated 5 years ago
- Set up the CTRL text-generating model on Google Compute Engine with just a few console commands.β151Updated 6 years ago
- Ensemble topic modelling with pLSAβ114Updated 4 years ago
- A clean and easy interface for performing nearest-neighbor lookupsβ50Updated 6 years ago
- Example using Polyaxon to experiment with pre-training spaCyβ65Updated 4 years ago
- A spell checker built from GloVe word vectorsβ81Updated 7 years ago
- Similarity search on Wikipedia using gensim in Python.β60Updated 7 years ago
- An author identification system based on recurβ21Updated 9 years ago
- Scripts for paper "Encoding high-cardinality string categorical variables"β24Updated 6 years ago