bloomberg / koanLinks
A word2vec negative sampling implementation with correct CBOW update.
β261Updated 3 years ago
Alternatives and similar repositories for koan
Users that are interested in koan are comparing it to the libraries listed below
Sorting:
- More interactive weak supervision with FlyingSquidβ315Updated 4 years ago
- πΈ fastText + Bloom embeddings for compact, full-coverage vectors with spaCyβ314Updated 2 months ago
- Create interactive textual heat maps for Jupiter notebooksβ196Updated last year
- Misspelling Oblivious Word Embeddingsβ201Updated 5 years ago
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddingsβ77Updated 2 years ago
- spaCy + UDPipeβ161Updated 3 years ago
- Self-Supervision for Named Entity Disambiguation at the Tailβ218Updated 3 years ago
- π Easy training and deployment of seq2seq models.β228Updated 4 years ago
- Labelling platform for text using weak supervision.β263Updated 3 years ago
- Tool for interactive embeddings visualizationβ314Updated 10 months ago
- An ML framework to accelerate research and its path to production.β266Updated 10 months ago
- LASER multilingual sentence embeddings as a pip packageβ224Updated last year
- Deep learning with text doesn't have to be scary.β276Updated 2 years ago
- SpikeX - SpaCy Pipes for Knowledge Extractionβ398Updated 3 years ago
- NeuralQA: A Usable Library for Question Answering on Large Datasets with BERTβ231Updated 2 years ago
- xfspell β the Transformer Spell Checkerβ190Updated 5 years ago
- Performance evaluation of nearest neighbor search using Vespa, Elasticsearch and Open Distro for Elasticsearch K-NNβ117Updated 4 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasksβ926Updated 10 months ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.β127Updated 4 years ago
- Neural Searchβ332Updated last year
- π°Natural language processing (NLP) newsletterβ301Updated 4 years ago
- Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/β192Updated last year
- Forte is a flexible and powerful ML workflow builder. This is part of the CASL project: http://casl-project.ai/β246Updated last year
- SummVis is an interactive visualization tool for text summarization.β253Updated 3 years ago
- Unsupervised Language Model Pre-training for Frenchβ248Updated 2 years ago
- PYthon Automated Term Extractionβ314Updated 2 years ago
- The Python library with command line tools to interact with Dynabench(https://dynabench.org/), such as uploading models.β55Updated 3 years ago
- Jupyter Widget for data annotationβ140Updated 2 years ago
- The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model Fine-tuning" https://arxiv.org/abs/1909.04761β283Updated 5 years ago
- Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.β254Updated 2 years ago