bloomberg / koanLinks
A word2vec negative sampling implementation with correct CBOW update.
β261Updated 3 years ago
Alternatives and similar repositories for koan
Users that are interested in koan are comparing it to the libraries listed below
Sorting:
- πΈ fastText + Bloom embeddings for compact, full-coverage vectors with spaCyβ313Updated last month
- Create interactive textual heat maps for Jupiter notebooksβ196Updated last year
- Self-Supervision for Named Entity Disambiguation at the Tailβ218Updated 3 years ago
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddingsβ77Updated 2 years ago
- Misspelling Oblivious Word Embeddingsβ201Updated 5 years ago
- More interactive weak supervision with FlyingSquidβ315Updated 4 years ago
- π°Natural language processing (NLP) newsletterβ301Updated 4 years ago
- SpikeX - SpaCy Pipes for Knowledge Extractionβ398Updated 3 years ago
- Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/β192Updated last year
- SummVis is an interactive visualization tool for text summarization.β253Updated 3 years ago
- NeuralQA: A Usable Library for Question Answering on Large Datasets with BERTβ231Updated 2 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasksβ926Updated 9 months ago
- Deep learning with text doesn't have to be scary.β276Updated 2 years ago
- Tool for interactive embeddings visualizationβ313Updated 10 months ago
- Labelling platform for text using weak supervision.β262Updated 2 years ago
- spaCy + UDPipeβ161Updated 3 years ago
- LASER multilingual sentence embeddings as a pip packageβ224Updated last year
- Google USE (Universal Sentence Encoder) for spaCyβ184Updated 2 years ago
- PYthon Automated Term Extractionβ313Updated 2 years ago
- Live Python Notebooks with any Editorβ280Updated 2 years ago
- The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model Fine-tuning" https://arxiv.org/abs/1909.04761β283Updated 5 years ago
- Jupyter Widget for data annotationβ139Updated 2 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in cβ¦β362Updated 3 years ago
- Flexible classic and NeurAl Retrieval Toolkitβ217Updated 4 months ago
- Performance evaluation of nearest neighbor search using Vespa, Elasticsearch and Open Distro for Elasticsearch K-NNβ117Updated 4 years ago
- π Easy training and deployment of seq2seq models.β228Updated 4 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated last year
- Question-answers, collected from Googleβ129Updated 3 years ago
- xfspell β the Transformer Spell Checkerβ190Updated 5 years ago
- Interpretable Evaluation for (Almost) All NLP Tasksβ195Updated 2 years ago