基于TF-IDF和余弦定理计算文本相似度
☆36Aug 29, 2018Updated 7 years ago
Alternatives and similar repositories for cosSim
Users that are interested in cosSim are comparing it to the libraries listed below
Sorting:
- NLP的一些小例子,如:文本分类、文本纠错、关键词提取、自动摘要等☆23Dec 12, 2018Updated 7 years ago
- 对四种句子/文本相似度计算方法进行实验与比较☆291Sep 1, 2020Updated 5 years ago
- Scholarly Big Data Subject Category Classifier☆10Jul 15, 2019Updated 6 years ago
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆13Feb 21, 2025Updated last year
- Python3 实现的文章余弦相似度计算☆10Sep 28, 2017Updated 8 years ago
- A set of visualization engines.☆14Updated this week
- An email client in C# using WPF☆11May 14, 2015Updated 10 years ago
- 爬虫豆瓣读书评分9分以上榜单☆42Apr 12, 2020Updated 5 years ago
- D3-based interactive bubble chart for topic model visualization☆13May 10, 2022Updated 3 years ago
- Build Modern Chatbot using Rasa☆10Aug 21, 2022Updated 3 years ago
- 使用Scrapy爬取主流网站的项目集合,持续更新。☆10Nov 13, 2024Updated last year
- ☆13May 25, 2023Updated 2 years ago
- Blog,Algorithm,C,C++,Linux☆10Jun 4, 2016Updated 9 years ago
- Colab notebooks for d2l-book☆11Dec 5, 2019Updated 6 years ago
- PyTorch - Albert Large V2, Bert Base Uncased, Bert Large Uncased WWM Finetuned Squad, Distil Roberta Base, Roberta Base Squad2, Roberta l…☆11Jul 10, 2020Updated 5 years ago
- Ask questions about government data.☆38Jan 17, 2019Updated 7 years ago
- Explore importing the Semantic Scholar Academic Graph Corpus into a PostgreSQL database☆13Aug 30, 2024Updated last year
- Code and data associated with our LREC 2018 and COLING 2018 papers on converting between emotion formats☆10Dec 15, 2022Updated 3 years ago
- Mirror of pdftk. For more information please see http://flowpaper.com☆11Sep 6, 2016Updated 9 years ago
- Demo for Apache Tika☆13Oct 12, 2015Updated 10 years ago
- 根据网易云歌单ID 爬取歌单内所有歌曲的歌词 并根据歌词中词语出现的频率生成词云图☆13Apr 4, 2018Updated 7 years ago
- Visualizes search engine ranking algorithms for a given domain☆30Dec 13, 2010Updated 15 years ago
- 国内外主流搜索引擎爬虫☆12Aug 5, 2018Updated 7 years ago
- Shadowsocksr client using electron☆11Jul 3, 2018Updated 7 years ago
- 模拟浏览器脚本操作,使用nodejs来批量读取和操作网盘文件信息。 这个代码库是`百度网盘批 量清理重复文件计划`的一部分。☆11Mar 16, 2023Updated 2 years ago
- ☆10Sep 16, 2021Updated 4 years ago
- Codes and Datasets for our SIGIR 2021 Paper: "Understanding the Role of Affect Dimensions in Detecting Emotions from Tweets: A Multi-task…☆12Apr 21, 2021Updated 4 years ago
- python script to extract jpg images from pdf☆13Sep 18, 2017Updated 8 years ago
- CCF BDCI 2022比赛 返乡发展人群预测赛题 Baseline 数据挖掘(特征工程+集成学习)队伍排名39/2297☆12Mar 15, 2024Updated last year
- A monolingual parallel corpus for sentence simplification☆11Jul 4, 2016Updated 9 years ago
- This repository contains some examples of using borb in google colab. These examples enable you to try out the features of borb without i…☆13Sep 4, 2022Updated 3 years ago
- Scout - commmandline tool for command-not-found operations☆13Feb 22, 2026Updated last week
- Transformer(attention-is-all-you-need)的pytorch实现,带run demo,可以跑通☆10Apr 16, 2019Updated 6 years ago
- ☆19Jun 4, 2025Updated 8 months ago
- Stack Overflow Command Line☆22Apr 2, 2012Updated 13 years ago
- 目前多流形学习算法matlab代码☆13Nov 23, 2018Updated 7 years ago
- API and CLI for getting the stars for one or more GitHub users or organizations.☆18Sep 13, 2017Updated 8 years ago
- Simple wrapper around Puppeteer to take screenshot from command line.☆16Feb 12, 2022Updated 4 years ago
- ☆13Feb 16, 2021Updated 5 years ago