bloomberg / koan
A word2vec negative sampling implementation with correct CBOW update.
β261Updated 3 years ago
Alternatives and similar repositories for koan:
Users that are interested in koan are comparing it to the libraries listed below
- πΈ fastText + Bloom embeddings for compact, full-coverage vectors with spaCyβ307Updated last year
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddingsβ76Updated 2 years ago
- Create interactive textual heat maps for Jupiter notebooksβ196Updated 8 months ago
- More interactive weak supervision with FlyingSquidβ315Updated 4 years ago
- Self-Supervision for Named Entity Disambiguation at the Tailβ215Updated 2 years ago
- Labelling platform for text using weak supervision.β260Updated 2 years ago
- Tool for interactive embeddings visualizationβ303Updated 6 months ago
- skweak: A software toolkit for weak supervision applied to NLP tasksβ923Updated 5 months ago
- SpikeX - SpaCy Pipes for Knowledge Extractionβ397Updated 3 years ago
- spaCy + UDPipeβ160Updated 2 years ago
- Flexible classic and NeurAl Retrieval Toolkitβ215Updated last week
- Fuzzy matching and more functionality for spaCy.β254Updated 7 months ago
- A library to synthesize text datasets using Large Language Models (LLM)β151Updated 2 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.β145Updated 3 years ago
- Live Python Notebooks with any Editorβ277Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated 11 months ago
- LASER multilingual sentence embeddings as a pip packageβ224Updated last year
- Misspelling Oblivious Word Embeddingsβ203Updated 5 years ago
- π°Natural language processing (NLP) newsletterβ301Updated 4 years ago
- Google USE (Universal Sentence Encoder) for spaCyβ182Updated last year
- Docsβ143Updated 2 months ago
- SummVis is an interactive visualization tool for text summarization.β252Updated 2 years ago
- Deep learning with text doesn't have to be scary.β276Updated 2 years ago
- Question-answers, collected from Googleβ126Updated 3 years ago
- A Lightweight NLP Data Loader for All Deep Learning Frameworks in Pythonβ181Updated last year
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.β126Updated 4 years ago
- Performance evaluation of nearest neighbor search using Vespa, Elasticsearch and Open Distro for Elasticsearch K-NNβ117Updated 3 years ago
- Visualising the Transformer encoderβ111Updated 4 years ago
- Machine Learning for Information Retrievalβ85Updated last week
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!β473Updated 2 years ago