基于word2vec使用wiki中文语料库实现词向量训练模型
☆59May 22, 2019Updated 6 years ago
Alternatives and similar repositories for wiki-word2vec
Users that are interested in wiki-word2vec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 使用gensim训练word2vec模型并对训练得到词向量聚类☆16Sep 23, 2017Updated 8 years ago
- 用百科数据和搜狗新闻数据训练word2vec模型☆19Apr 1, 2018Updated 8 years ago
- 中文的word2vec以及doc2vec模型,使用维基百度的数据训练。供大家参考☆47May 8, 2018Updated 8 years ago
- A text classification and similairty computing project in Python.We have tried wordbag,word2vec,WordMoverDistance,N-gram,LSTM,C-LSTM, LST…☆11May 18, 2019Updated 6 years ago
- 利用Python构建Wiki中文语料词向量模型试验☆524Dec 2, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 基于word2vec预训练词向量; textCNN 模型 ;charCNN 模型 ;Bi-LSTM模型;Bi-LSTM + Attention 模型 ;Transformer 模型 ;ELMo 预训练模型 ;BERT 预训练模型的文本分类项目☆123Jul 24, 2020Updated 5 years ago
- ☆29May 30, 2019Updated 6 years ago
- 使用Bi-LSTM和crf来进行人名识别,数据集人民日报98年1月标注数据集,训练:验证:测试为3:1:1☆22Jul 25, 2018Updated 7 years ago
- NER for Chinese electronic medical records. Use doc2vec, self_attention and multi_attention.☆28Jul 28, 2018Updated 7 years ago
- The pytorch implementation of Cluster-Aware Supervised Contrastive Learning on Graphs (WWW 2022).☆11Jun 6, 2022Updated 3 years ago
- 利用bert预训练模型生成句向量或词向量☆26Oct 29, 2020Updated 5 years ago
- Provide microservice API for HanLP☆11Jun 21, 2022Updated 3 years ago
- A Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime☆15Dec 7, 2024Updated last year
- 使用中文维基百科语料库训练一个word2vec模型(250维)并使用说明☆11Apr 24, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 基于语义的文本相似度计算☆10Jan 22, 2019Updated 7 years ago
- Nowcasting macroeconomic indicators using Google Trends☆10Jun 23, 2022Updated 3 years ago
- 文本关键词提取,对文本分词后使用多种方法提取给定语料中的关键词,包含结巴自带的 TF-IDF 算法、TextRank 算法、Scikit-Learn 包中的 TF-IDF☆11Jan 4, 2019Updated 7 years ago
- Using CNN to classify RF modulation data.☆14Jun 25, 2020Updated 5 years ago
- 基于Tensorflow2.0和Transformer实现机器翻译代码详解☆21Dec 4, 2019Updated 6 years ago
- ☆11Mar 30, 2021Updated 5 years ago
- 垃圾邮件检测 词袋模型+机器学习、word2vec+cnn☆19Jul 11, 2019Updated 6 years ago
- Fall 2023 NJUSE Machine Learning Course -- Group Project: DeepEMD for LibFewShot☆11May 16, 2024Updated last year
- 可部署的相似度模型 deployable similarity model☆17Oct 27, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 中文命名实体识别& 中文命名实体检测 python实现 基于字+ 词位 分别使用tensorflow IDCNN+CRF 及 BiLSTM+CRF 搭配词性标注实现中文命名实体识别及命名实体检测☆65Dec 13, 2018Updated 7 years ago
- 关键词抽取技术☆18Sep 11, 2019Updated 6 years ago
- AI100竞赛:http://competition.ai100.com.cn/html/game_det.html?id = 24&tab = 1 的代码,主要用于文本分类,其中涉及CHI选择特征词,TFIDF计算权重,朴素贝叶斯,决策树,SVM,XGBoost等算法☆15Mar 27, 2019Updated 7 years ago
- Predicting Unplanned Hospital Readmission Using Natural Language Processing of MIMICIII Discharge Notes☆12Feb 12, 2019Updated 7 years ago
- 2016年课程设计:人事管理系统(荆超等11人)☆10Jul 13, 2016Updated 9 years ago
- UCSD CSE 237D Spring '20 Course Project☆20Sep 4, 2023Updated 2 years ago
- 基于bert中文多分类模型☆11Mar 23, 2019Updated 7 years ago
- GBDT结合LR的二分类模型,封装成了一个类。scikit-learn风格,可以fit和predict。有run_demo☆11Sep 5, 2019Updated 6 years ago
- Research project on glyph-based Chinese character embedding. Preparing for EMNLP 2019☆11Mar 18, 2019Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The Pytorch implementation demo for our TMM article: Self-consistent Contrastive Attributed Graph Clustering with Pseudo-label Prompt.☆15Dec 6, 2022Updated 3 years ago
- Bidirectional Recurrent Neural Network based sequence labeling for Medical Text.☆24May 22, 2016Updated 9 years ago
- 最大开源中文问答数据集 ,助力中文LLM.The largest open-source Chinese Q&A dataset, supporting Chinese LLM☆10Jul 31, 2023Updated 2 years ago
- 领域自适应文本挖掘工具(新词发现、情感分析、实体链接等),基于少量种子词和背景知识☆13Jun 19, 2019Updated 6 years ago
- bert 词向量 句向量生成☆12Sep 1, 2019Updated 6 years ago
- ☆13Aug 9, 2023Updated 2 years ago
- ☆18May 22, 2020Updated 5 years ago