bloomberg / koanLinks
A word2vec negative sampling implementation with correct CBOW update.
β261Updated 4 years ago
Alternatives and similar repositories for koan
Users that are interested in koan are comparing it to the libraries listed below
Sorting:
- More interactive weak supervision with FlyingSquidβ316Updated 5 years ago
- πΈ fastText + Bloom embeddings for compact, full-coverage vectors with spaCyβ327Updated 7 months ago
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddingsβ77Updated 3 years ago
- Create interactive textual heat maps for Jupiter notebooksβ196Updated last year
- An ML framework to accelerate research and its path to production.β268Updated last year
- Deep learning with text doesn't have to be scary.β275Updated 2 years ago
- spaCy + UDPipeβ163Updated 3 years ago
- Misspelling Oblivious Word Embeddingsβ201Updated 6 years ago
- Self-Supervision for Named Entity Disambiguation at the Tailβ218Updated 3 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learningβ42Updated 5 years ago
- Tool for interactive embeddings visualizationβ323Updated last year
- π Easy training and deployment of seq2seq models.β228Updated 4 years ago
- SummVis is an interactive visualization tool for text summarization.β253Updated 3 years ago
- Labelling platform for text using weak supervision.β262Updated 3 years ago
- Custom Natural Language Processing with big and small models π²π±β66Updated 4 years ago
- NeuralQA: A Usable Library for Question Answering on Large Datasets with BERTβ234Updated 2 years ago
- Topic Inference with Zeroshot modelsβ61Updated 2 years ago
- LASER multilingual sentence embeddings as a pip packageβ225Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)β152Updated 2 years ago
- SpikeX - SpaCy Pipes for Knowledge Extractionβ400Updated 4 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.β127Updated 5 years ago
- The Python library with command line tools to interact with Dynabench(https://dynabench.org/), such as uploading models.β55Updated 3 years ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."β27Updated 3 years ago
- Jupyter Widget for data annotationβ141Updated 2 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.β87Updated 2 months ago
- Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/β193Updated 2 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficientlyβ¦β108Updated last year
- Sentence transformers models for SpaCyβ109Updated 2 years ago
- π°Natural language processing (NLP) newsletterβ303Updated 5 years ago
- The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model Fine-tuning" https://arxiv.org/abs/1909.04761β282Updated 5 years ago