shriphani / polyglot-toolboxLinks
Polyglot skipgram embeddings, and their many health benefits
☆12Updated 5 years ago
Alternatives and similar repositories for polyglot-toolbox
Users that are interested in polyglot-toolbox are comparing it to the libraries listed below
Sorting:
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddings☆77Updated 3 years ago
- A word2vec negative sampling implementation with correct CBOW update.☆260Updated 3 years ago
- An easy to use open-source library for advanced Deep Learning and Natural Language Processing☆112Updated last year
- Hyperbolic (Poincare, Lorentz) Embeddings for TensorFlow☆55Updated 6 years ago
- Representing research papers as vectors / latent representations.☆198Updated 4 years ago
- 📝🔍 A browser extension that displays the GPT-2 Log Probability of selected text☆112Updated 2 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆42Updated 5 years ago
- Deep Semantic Code Search aims to explore a joint embedding space for code and description vectors and then use it for a code search appl…☆65Updated last year
- McKernel: A Library for Approximate Kernel Expansions in Log-linear Time.☆13Updated 2 years ago
- A spell checker built from GloVe word vectors☆81Updated 7 years ago
- Discover relevant information about categorical data with entity embeddings using Neural Networks (powered by Keras)☆70Updated 2 years ago
- Automatically labeling training data☆107Updated 6 years ago
- ☆123Updated 2 years ago
- This blog post visualize vector norms of FastText embedding and evaluates use of FastText word vector norm multiplied with number of word…☆19Updated 2 years ago
- Enso: An Open Source Library for Benchmarking Embeddings + Transfer Learning Methods☆95Updated 4 years ago
- Ensemble topic modelling with pLSA☆115Updated 3 years ago
- Natural Language Data Augmentation Tool for Conversational Systems☆115Updated 2 years ago
- Find strings/words in text; convenience and C speed☆127Updated 2 years ago
- Set up the CTRL text-generating model on Google Compute Engine with just a few console commands.☆149Updated 5 years ago
- ULMFiT + Siamese Network for Sentence Vectors☆33Updated 6 years ago
- Fast and customizable tokenization☆66Updated 6 years ago
- Tool for interactive embeddings visualization☆315Updated last year
- A clean and easy interface for performing nearest-neighbor lookups☆50Updated 5 years ago
- Similarity search on Wikipedia using gensim in Python.☆60Updated 6 years ago
- Twitterbot that uses machine learning to curate interesting arXiv papers☆43Updated 7 years ago
- More interactive weak supervision with FlyingSquid☆315Updated 5 years ago
- Notebooks and data associated to constructing and exploring a map of subreddits.☆55Updated 8 years ago
- Example using Polyaxon to experiment with pre-training spaCy☆65Updated 3 years ago
- Implementation of GloVe in Keras☆45Updated 2 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago