本地语料很多?爬到的文档很多?运行出无序结果很多?我们经常面对一些搜索引擎无法检索的文本/或其它程序运行结果,想要对这些内容进行检索、按相关性排序等。MySearch是用python3写的,用于方便中英文检索的小脚本,中文分词基于jieba、pkuseg,相关性排序基于sklearn的tf-idf
☆20Feb 2, 2019Updated 7 years ago
Alternatives and similar repositories for MySearch
Users that are interested in MySearch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 简单搜索引擎,实现了拼写检查、倒排索引 、文档排序☆19May 7, 2019Updated 7 years ago
- 以前的伪原创类,放这做个纪念,仅此。☆14Aug 8, 2017Updated 8 years ago
- phonetic similarity algorithms☆13Jun 19, 2018Updated 7 years ago
- 快速搭建一个搜索引擎,示例程序☆10Aug 10, 2016Updated 9 years ago
- 问答系统☆13Apr 3, 2019Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 新词发现,信息熵,左右互信息☆16Nov 3, 2018Updated 7 years ago
- 伪原创相关☆14Sep 4, 2019Updated 6 years ago
- A python wrapper for kaldi-online-decoder using Cython☆12Sep 1, 2017Updated 8 years ago
- 一个基于elasticsearch开发的搜索引擎网站☆14Nov 22, 2022Updated 3 years ago
- ConceptNet to neo4j 2.2☆10Nov 6, 2015Updated 10 years ago
- Named entity recognition system using multi-stage CRF and statistical rules☆11Oct 3, 2016Updated 9 years ago
- Experiment with JNI access to some Kaldi functions.☆12Dec 31, 2018Updated 7 years ago
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 7 years ago
- 问答摘要/seq2seq/PGN/Bert_sum/UniLM☆19Oct 4, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- QA Server Based Chinese CQA Site☆12Jul 14, 2021Updated 4 years ago
- Turn Chinese natural language into structured data 中文自然语言理解,并支持spacy☆13Jul 9, 2024Updated last year
- Spoken Language Identification from Short Utterances☆13Jul 6, 2022Updated 3 years ago
- Simple scripts to generate and use an Annoy index and lmdb map☆28Jan 4, 2018Updated 8 years ago
- 医疗命名实体识别, CRF,☆13Jun 26, 2019Updated 6 years ago
- 文章标签抽取☆16Dec 17, 2018Updated 7 years ago
- finetune the chain model based on cvte open source model without traing any GMM for frame alignment☆12Aug 6, 2020Updated 5 years ago
- 基于 Redis 的全文检索引擎和自然语言处理工具☆15Jul 30, 2014Updated 11 years ago
- 中国股市从1990年发展至今才仅仅28年,正处于一个茁壮成长的黄金时期。 股票市场发展至今形成了两大流派,技术派和基本派,基本分析以公司的价值为投资对象,发掘公司未来的投资价值;而技术分析则以行为为主,通过股票的历史走势,各种形态指标为研究对象;在股票市场的推进中,信息越发…☆10Apr 13, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 法律数据挖掘☆22Jan 29, 2021Updated 5 years ago
- Postprocess SRT derived speech alignments for creating clean datasets for machine learning☆17Jan 4, 2023Updated 3 years ago
- 文本标注工具,给文本打标签☆21Jan 9, 2020Updated 6 years ago
- 用于分库分表,表结构完全相同情况下从Mysql数据到导入数据到Elasticsearch搜索引擎。☆21Apr 7, 2016Updated 10 years ago
- Python version Aho-Corasic Automaton.☆19Jul 5, 2021Updated 4 years ago
- ☆12Feb 14, 2019Updated 7 years ago
- 新词发现算法与同义词挖掘☆27Oct 24, 2017Updated 8 years ago
- 基于ElasticSearch的海量文本检索系统☆20Jun 3, 2018Updated 8 years ago
- Time entity recognition tool based on regular expression 基于正则表达式的中文时间实体识别(时间提取)工具☆25Nov 9, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Simple arithmetic expression parser☆34Mar 24, 2022Updated 4 years ago
- seo伪原创工具GUI,SEO文章伪原创工具GUI☆20Jun 13, 2018Updated 8 years ago
- 🗣 Multilingual RDF Verbalizer – Google Summer of Code 2019☆21Mar 24, 2023Updated 3 years ago
- 北邮暑期课程信息检索与信息抽取课程设计☆17Oct 23, 2019Updated 6 years ago
- rasa_contrib is a addon package for rasa. It provide some useful/powerful addition components☆21Dec 8, 2022Updated 3 years ago
- 近年来,随着微信、微博、市长信箱、阳光热线等网络问政平台逐步成为政府了解民意、汇聚民智、凝聚民气的重要渠道,各类社情民意相关的文本数据量不断攀升,给以往主要依靠人工来进行留言划分和热点整理的相关部门的工作带来了极大挑战。同时,随着大数据技术的发展,建立基于自然语言处理技术的…☆36Jun 28, 2020Updated 5 years ago
- a basic bot to evaluate using neo4j graph database as chatbot memory.☆17Apr 3, 2018Updated 8 years ago