lintseju / word_embedding
Sample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding.
β27Updated 9 months ago
Alternatives and similar repositories for word_embedding:
Users that are interested in word_embedding are comparing it to the libraries listed below
- Fine tuning bert for text generationβ37Updated 5 years ago
- π³ NLPrep - dataset tool for many natural language processing taskβ28Updated 3 years ago
- A 30000+ Chinese MRC dataset - Delta Reading Comprehension Datasetβ306Updated 4 years ago
- π€π handling multiple nlp task in one pipelineβ56Updated last year
- βοΈTool for NLP - handle file and textβ15Updated 6 months ago
- Keyphrase Extraction based on Scientific Text, Semeval 2017, Task 10β108Updated 2 years ago
- Tutorial for Chinese Sentiment analysis with hotel review dataβ45Updated 7 years ago
- (WIP) My humble contribution to the democratization of the Chinese NLP technologyβ46Updated 5 years ago
- BERT CRF model for Name Entity Recognition in pytorchβ29Updated last year
- β97Updated 5 years ago
- PyTorch Implementation of NBA game summary generator.β82Updated 2 years ago
- ε°εQAεηζ©ε¨δΊΊ(δ½Ώη¨BERTγALBERT)β41Updated 4 years ago
- Multi-Grained Named Entity Recognition (ACL 2019)β35Updated 5 years ago
- Code and data for paper "Dialog Intent Induction with Deep Multi-View Clustering", Hugh Perkins and Yi Yang, 2019, EMNLP 2019β66Updated last year
- COS960: A Chinese Word Similarity Dataset of 960 Word Pairsβ35Updated 5 years ago
- TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)β172Updated 2 years ago
- This repo contains a PyTorch implementation of a pretrained BERT model for sentence similarity task.β48Updated 5 years ago
- β78Updated 5 years ago
- Position embedding layers in Kerasβ58Updated 2 years ago
- Easily generate document/paragraph/sentence vectors and calculate similarity.β137Updated 3 years ago
- This model base on bert-as-service. Model structure : bert-embedding bilstm crf.β37Updated 6 years ago
- The enhanced version of ZEN, larger and more powerful.β28Updated 2 years ago
- Heterogeneous Representations for Neural Relation Extraction https://arxiv.org/abs/1903.10126β69Updated 4 years ago
- β9Updated 10 years ago
- Subword Encoding in Lattice LSTM for Chinese Word Segmentationβ53Updated 5 years ago
- Rank-based Unsupervised Keyword Extraction via Metavertex Aggregationβ99Updated 2 months ago
- β39Updated 2 years ago
- Experiment on NER task using Huggingface state-of-the-art Transformers Natural Language Models libraryβ40Updated last year
- θ¨η·΄δΈζθ©ει Word2vec, Word2vec was created by a team of researchers led by Tomas Mikolov at Google.β58Updated last year