lintseju / word_embedding
Sample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding.
☆27Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for word_embedding
- A 30000+ Chinese MRC dataset - Delta Reading Comprehension Dataset☆304Updated 4 years ago
- ☆97Updated 5 years ago
- PyTorch Implementation of NBA game summary generator.☆83Updated 2 years ago
- Fine tuning bert for text generation☆38Updated 5 years ago
- 台北QA問答機器人(使用BERT、ALBERT)☆41Updated 4 years ago
- (WIP) My humble contribution to the democratization of the Chinese NLP technology☆46Updated 5 years ago
- Keyphrase Extraction based on Scientific Text, Semeval 2017, Task 10☆108Updated 2 years ago
- COS960: A Chinese Word Similarity Dataset of 960 Word Pairs☆36Updated 5 years ago
- 🍳 NLPrep - dataset tool for many natural language processing task☆28Updated 3 years ago
- ☆91Updated 6 months ago
- 公開的情緒訓練資料☆58Updated last year
- ⚙️Tool for NLP - handle file and text☆15Updated 4 months ago
- [AAAI 2019] A Unified Model for Opinion Target Extraction and Target Sentiment Prediction☆272Updated last year
- TensorFlow implementation of the paper "Hierarchical Attention Networks for Document Classification"☆86Updated 5 years ago
- Open Source State-of-the-art Chinese Word Segmentation System with BiLSTM and ELMo. https://arxiv.org/abs/1901.05816☆39Updated 3 years ago
- A TensorFlow implementation for "Interactive Attention Networks for Aspect-Level Sentiment Classification"☆99Updated 4 years ago
- PTT 八卦版問答中文語料☆238Updated 3 weeks ago
- Deep Keyphrase Extraction using BERT☆256Updated 2 years ago
- Berserker - BERt chineSE woRd toKenizER☆17Updated 5 years ago
- A Multi-Aspect Multi-Sentiment Dataset for aspect-based sentiment analysis.☆255Updated 5 years ago
- [AAAI 2019] Weakly-Supervised Hierarchical Text Classification☆86Updated 2 years ago
- seq2seq based keyphrase generation model sets, including copyrnn copycnn and copytransfomer☆51Updated 2 years ago
- 如何將維基百科中文資料,簡轉繁並萃取文字內容整理成JSON檔案☆18Updated 3 years ago
- 訓練中文詞向量 Word2vec, Word2vec was created by a team of researchers led by Tomas Mikolov at Google.☆58Updated last year
- CNN-based model to realize aspect extraction of restaurant reviews based on pre-trained word embeddings and part-of-speech tagging☆104Updated 5 years ago
- a python implementation of probabilistic latent semantic analysis (plsa) using EM algorithm☆65Updated 5 years ago
- ☆78Updated 5 years ago
- Named Entity Recognition with Pretrained XLM-RoBERTa☆87Updated 3 years ago
- Chinese version code for the paper "EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks"☆11Updated 5 years ago
- A web crawler specifically for PTT website.☆19Updated 6 years ago