bloomberg / koanLinks
A word2vec negative sampling implementation with correct CBOW update.
☆260Updated 3 years ago
Alternatives and similar repositories for koan
Users that are interested in koan are comparing it to the libraries listed below
Sorting:
- More interactive weak supervision with FlyingSquid☆315Updated 4 years ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆317Updated 3 months ago
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddings☆77Updated 3 years ago
- Create interactive textual heat maps for Jupiter notebooks☆196Updated last year
- Misspelling Oblivious Word Embeddings☆201Updated 5 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆41Updated 5 years ago
- An ML framework to accelerate research and its path to production.☆266Updated 10 months ago
- Tool for interactive embeddings visualization☆314Updated 11 months ago
- spaCy + UDPipe☆162Updated 3 years ago
- Podium: a framework agnostic Python NLP library for data loading and preprocessing☆60Updated 2 years ago
- LASER multilingual sentence embeddings as a pip package☆224Updated last year
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆127Updated 4 years ago
- SummVis is an interactive visualization tool for text summarization.☆253Updated 3 years ago
- Self-Supervision for Named Entity Disambiguation at the Tail☆219Updated 3 years ago
- Deep learning with text doesn't have to be scary.☆276Updated 2 years ago
- NeuralQA: A Usable Library for Question Answering on Large Datasets with BERT☆233Updated 2 years ago
- The Python library with command line tools to interact with Dynabench(https://dynabench.org/), such as uploading models.☆55Updated 3 years ago
- Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/☆192Updated last year
- Jupyter Widget for data annotation☆140Updated 2 years ago
- Camphr - NLP libary for creating pipeline components☆339Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- xfspell — the Transformer Spell Checker☆190Updated 5 years ago
- 🏖 Easy training and deployment of seq2seq models.☆228Updated 4 years ago
- Unsupervised Language Model Pre-training for French☆248Updated 2 years ago
- A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python☆181Updated last year
- NanigoNet — Language detector for code-mixed input supporting 150+19 human+programming languages using deep neural networks☆72Updated 2 years ago
- NER, syntax markup visualizations☆139Updated 2 years ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆26Updated 2 years ago
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- Performance evaluation of nearest neighbor search using Vespa, Elasticsearch and Open Distro for Elasticsearch K-NN☆117Updated 4 years ago