A word2vec negative sampling implementation with correct CBOW update.
☆261Nov 8, 2021Updated 4 years ago
Alternatives and similar repositories for koan
Users that are interested in koan are comparing it to the libraries listed below
Sorting:
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Nov 29, 2021Updated 4 years ago
- ☆13Aug 13, 2020Updated 5 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- Getting interpretable dimensions in word embedding spaces.☆15Jul 6, 2023Updated 2 years ago
- ☆15Dec 20, 2020Updated 5 years ago
- Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.☆320Mar 1, 2024Updated 2 years ago
- code and data for paper "One-shot Text Field Labeling using Attention and BeliefPropagation for Structure Information Extraction"☆61Aug 9, 2020Updated 5 years ago
- Compute Sentence Embeddings Fast!☆624Mar 2, 2023Updated 3 years ago
- Top2Vec learns jointly embedded topic, document and word vectors.☆3,108Nov 14, 2024Updated last year
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- Code release for "A Time-Aware Transformer Based Model for Suicide Ideation Detection on Social Media", EMNLP 2020.☆54Nov 16, 2020Updated 5 years ago
- Domain-specific BERT representation for Named Entity Recognition of lab protocol☆29Dec 25, 2020Updated 5 years ago
- spaCy + UDPipe☆166Apr 19, 2022Updated 3 years ago
- Explainable Zero-Shot Topic Extraction☆65Aug 19, 2024Updated last year
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,265Jul 24, 2025Updated 7 months ago
- ANYKS Spell-Checker☆19Jan 3, 2023Updated 3 years ago
- ☆17Sep 22, 2020Updated 5 years ago
- A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data…☆243May 12, 2024Updated last year
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…☆2,083Aug 15, 2024Updated last year
- A fast, efficient universal vector embedding utility package.☆1,655Aug 3, 2023Updated 2 years ago
- Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"☆14Apr 28, 2020Updated 5 years ago
- SummVis is an interactive visualization tool for text summarization.☆254Jun 17, 2022Updated 3 years ago
- The implementation of the papers on dual learning of natural language understanding and generation. (ACL2019,2020; Findings of EMNLP 2020…☆67Oct 13, 2020Updated 5 years ago
- Active Learning for Text Classification in Python☆639Feb 1, 2026Updated last month
- DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks☆1,267Mar 2, 2023Updated 3 years ago
- Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing☆790Jul 22, 2025Updated 7 months ago
- Repository for the paper "Optimal Subarchitecture Extraction for BERT"☆470Jun 22, 2022Updated 3 years ago
- Information extraction from English and German texts based on predicate logic☆394Jul 8, 2022Updated 3 years ago
- linear-time dynamic programming dependency parser☆11Feb 2, 2019Updated 7 years ago
- ☆19Sep 16, 2025Updated 5 months ago
- ☆13Apr 16, 2021Updated 4 years ago
- On Generating Extended Summaries of Long Documents☆78Jan 26, 2021Updated 5 years ago
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"☆55Dec 2, 2021Updated 4 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Sep 22, 2025Updated 5 months ago
- A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural …☆2,933Nov 7, 2022Updated 3 years ago
- A framework for detecting, highlighting and correcting grammatical errors on natural language text. Created by Prithiviraj Damodaran. Ope…☆1,573Feb 15, 2023Updated 3 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 2 years ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,221Oct 1, 2024Updated last year