mheilman / tan-clusteringLinks

Hierarchical word clustering, following "Brown clustering" (Brown et al., 1992)

☆70

Alternatives and similar repositories for tan-clustering

Users that are interested in tan-clustering are comparing it to the libraries listed below

Sorting:

bplank / bilstm-aux
Bidirectional Long-Short Term Memory tagger (bi-LSTM) (in DyNet) -- hierarchical (with word and character embeddings)
☆123Updated 2 years ago
rguthrie3 / MorphologicalPriorsForWordEmbeddings
Code for EMNLP 2016 paper: Morphological Priors for Probabilistic Word Embeddings
☆53Updated 8 years ago
NLPrinceton / text_embedding
utility class for building/evaluating document representations
☆53Updated 5 years ago
alvations / stasis
Semantic Textual Similarity in Python
☆80Updated 8 years ago
fh295 / SentenceRepresentation
☆125Updated 8 years ago
mfaruqui / non-distributional
Non-distributional linguistic word vector representations.
☆62Updated 8 years ago
mfaruqui / eval-word-vectors
Easy to use scripts for evaluating word vectors on a variety of tasks.
☆119Updated 4 years ago
jwieting / charagram
Code to train and use models from "Charagram: Embedding Words and Sentences via Character n-grams".
☆124Updated 9 years ago
minimalparts / nonce2vec
Incremental learning of word embeddings with context informativeness.
☆94Updated 2 years ago
JonathanRaiman / wikipedia_ner
Labeled examples from wiki dumps in Python
☆67Updated 9 years ago
ikekonglp / TweeboParser
A Dependency Parser for Tweets
☆78Updated 6 years ago
artetxem / uncovec
Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation
☆63Updated 7 years ago
dmcc / PyStanfordDependencies
Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies
☆69Updated 6 years ago
pdasigi / onto-lstm
Keras implementation of ontology aware token embeddings
☆49Updated 7 years ago
uhh-lt / sensegram
Making sense embedding out of word embeddings using graph-based word sense induction
☆213Updated 4 years ago
stanfordnlp / stanza-old
Stanford NLP group's shared Python tools.
☆136Updated 7 years ago
jiyfeng / entitynlm
☆44Updated 8 years ago
ikekonglp / PAD
☆44Updated 10 years ago
cdg720 / emnlp2016
☆47Updated 8 years ago
sean-chester / generalised-brown
C++ implementation of Generalised Brown clustering and python scripts for feature generation
☆41Updated 9 years ago
jonsafari / clustercat
Fast Word Clustering Software
☆79Updated 9 months ago
ytsvetko / qvec
Intrinsic evaluation of word vectors
☆76Updated 7 years ago
swabhs / joint-lstm-parser
Transition-based joint syntactic dependency parser and semantic role labeler using a stack LSTM RNN architecture.
☆61Updated 8 years ago
karlmoritz / bicvm
BiCVM Code
☆45Updated 7 years ago
yotam-happy / NEDforNoisyText
Named Entity Disambiguation for Noisy Text
☆66Updated 8 years ago
vered1986 / LexNET
LexNET: Integrated Path-based and Distributional Method for Lexical Semantic Relation Classification
☆62Updated 7 years ago
microth / PathLSTM
Neural SRL model
☆71Updated 3 years ago
ucam-smt / sgnmt
Decoding platform for machine translation research
☆54Updated 6 years ago
PrincetonML / SemanticVector
Word embedding approach based on a dynamic log-linear model
☆55Updated 8 years ago
gouwsmeister / bilbowa
Open-source implementation of the BilBOWA (Bilingual Bag-of-Words without Alignments) word embedding model.
☆69Updated 4 years ago