lintseju / word_embedding
Sample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding.
β27Updated 11 months ago
Alternatives and similar repositories for word_embedding:
Users that are interested in word_embedding are comparing it to the libraries listed below
- Fine tuning bert for text generationβ37Updated 5 years ago
- π³ NLPrep - dataset tool for many natural language processing taskβ28Updated 3 years ago
- βοΈTool for NLP - handle file and textβ15Updated last month
- π€π handling multiple nlp task in one pipelineβ56Updated last year
- β96Updated 6 years ago
- A 30000+ Chinese MRC dataset - Delta Reading Comprehension Datasetβ311Updated 4 years ago
- ε°εQAεηζ©ε¨δΊΊ(δ½Ώη¨BERTγALBERT)β42Updated 4 years ago
- PyTorch Implementation of NBA game summary generator.β81Updated 2 years ago
- (WIP) My humble contribution to the democratization of the Chinese NLP technologyβ46Updated 5 years ago
- β92Updated 4 months ago
- Open Source State-of-the-art Chinese Word Segmentation System with BiLSTM and ELMo. https://arxiv.org/abs/1901.05816β41Updated 3 years ago
- β9Updated 10 years ago
- Keyphrase Extraction based on Scientific Text, Semeval 2017, Task 10β108Updated 2 years ago
- Tutorial for Chinese Sentiment analysis with hotel review dataβ48Updated 7 years ago
- COS960: A Chinese Word Similarity Dataset of 960 Word Pairsβ35Updated 5 years ago
- A web crawler specifically for PTT website.β19Updated 6 years ago
- A short tutorial on Elmo training (Pre trained, Training on new data, Incremental training)β154Updated 4 years ago
- π hosting nlp models in one lineβ20Updated 10 months ago
- Phraseg - δΈθ¨οΌζ°θ©ηΌηΎε·₯ε ·εβ26Updated 3 years ago
- Berserker - BERt chineSE woRd toKenizERβ16Updated 6 years ago
- β38Updated 5 years ago
- Normalize text stringβ12Updated 6 years ago
- Official codes for the paper: A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss.β124Updated 5 years ago
- A list of awesome machine question answering dataset - ζ©ε¨εηζΈζιβ15Updated 5 years ago
- Position embedding layers in Kerasβ58Updated 3 years ago
- β78Updated 5 years ago
- seq2seq based keyphrase generation model sets, including copyrnn copycnn and copytransfomerβ50Updated 3 years ago
- Deep Keyphrase Extraction using BERTβ258Updated 3 years ago
- [ACM-WSDM] 3rd place solution at WSDM Cup 2019, Fake News Classification on Kaggle.β63Updated 5 years ago
- This is a chinese Bert model specific for question answeringβ26Updated 5 years ago