bloomberg / koanLinks
A word2vec negative sampling implementation with correct CBOW update.
☆260Updated 3 years ago
Alternatives and similar repositories for koan
Users that are interested in koan are comparing it to the libraries listed below
Sorting:
- More interactive weak supervision with FlyingSquid☆316Updated 5 years ago
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddings☆77Updated 3 years ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆324Updated 6 months ago
- Create interactive textual heat maps for Jupiter notebooks☆196Updated last year
- Misspelling Oblivious Word Embeddings☆201Updated 6 years ago
- Tool for interactive embeddings visualization☆318Updated last year
- 🏖 Easy training and deployment of seq2seq models.☆227Updated 4 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆42Updated 5 years ago
- Self-Supervision for Named Entity Disambiguation at the Tail☆218Updated 3 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆127Updated 4 years ago
- An ML framework to accelerate research and its path to production.☆267Updated last year
- LASER multilingual sentence embeddings as a pip package☆225Updated 2 years ago
- NeuralQA: A Usable Library for Question Answering on Large Datasets with BERT☆234Updated 2 years ago
- 📰Natural language processing (NLP) newsletter☆302Updated 5 years ago
- spaCy + UDPipe☆163Updated 3 years ago
- Deep learning with text doesn't have to be scary.☆275Updated 2 years ago
- Unsupervised Language Model Pre-training for French☆247Updated 2 years ago
- Labelling platform for text using weak supervision.☆262Updated 3 years ago
- SummVis is an interactive visualization tool for text summarization.☆253Updated 3 years ago
- Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/☆193Updated 2 years ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆399Updated 4 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- xfspell — the Transformer Spell Checker☆189Updated 5 years ago
- A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python☆181Updated last year
- The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model Fine-tuning" https://arxiv.org/abs/1909.04761☆282Updated 5 years ago
- Custom Natural Language Processing with big and small models 🌲🌱☆66Updated 4 years ago
- The Python library with command line tools to interact with Dynabench(https://dynabench.org/), such as uploading models.☆55Updated 3 years ago
- Google USE (Universal Sentence Encoder) for spaCy☆184Updated 2 years ago
- Polish BERT☆72Updated 4 years ago
- Self-training with Weak Supervision (NAACL 2021)☆161Updated 2 years ago