本地语料很多?爬到的文档很多?运行出无序结果很多?我们经常面对一些搜索引擎无法检索的文本/或其它程序运行结果,想要对这些内容进行检索、按相关性排序等。MySearch是用python3写的,用于方便中英文检索的小脚本,中文分词基于jieba、pkuseg,相关性排序基于sklearn的tf-idf
☆20Feb 2, 2019Updated 7 years ago
Alternatives and similar repositories for MySearch
Users that are interested in MySearch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 简单搜索引擎,实现了拼写检查、倒排索引 、文档排序☆19May 7, 2019Updated 7 years ago
- Scripts to train a seq2seq model using tensorflow 2☆11Dec 9, 2019Updated 6 years ago
- 快速搭建一个搜索引擎,示例程序☆10Aug 10, 2016Updated 9 years ago
- Java Bindings for the C++ library DeepSpeech☆10Jun 4, 2020Updated 5 years ago
- 伪原创相关☆14Sep 4, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 一个基于elasticsearch开发的搜索引擎网站☆14Nov 22, 2022Updated 3 years ago
- ConceptNet to neo4j 2.2☆10Nov 6, 2015Updated 10 years ago
- Named entity recognition system using multi-stage CRF and statistical rules☆12Oct 3, 2016Updated 9 years ago
- Experiment with JNI access to some Kaldi functions.☆12Dec 31, 2018Updated 7 years ago
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 6 years ago
- 问答摘要/seq2seq/PGN/Bert_sum/UniLM☆19Oct 4, 2020Updated 5 years ago
- QA Server Based Chinese CQA Site☆12Jul 14, 2021Updated 4 years ago
- Turn Chinese natural language into structured data 中文自然语言理解,并支持spacy☆13Jul 9, 2024Updated last year
- Spoken Language Identification from Short Utterances☆13Jul 6, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Simple scripts to generate and use an Annoy index and lmdb map☆28Jan 4, 2018Updated 8 years ago
- Feature extraction for accented-speech or pathological speech☆18Apr 2, 2019Updated 7 years ago
- 医疗命名实体识别, CRF,☆13Jun 26, 2019Updated 6 years ago
- 文章标签抽取☆16Dec 17, 2018Updated 7 years ago
- Bi-directional streaming speech-to-text service using Cloud ASRs☆15Aug 23, 2017Updated 8 years ago
- finetune the chain model based on cvte open source model without traing any GMM for frame alignment☆13Aug 6, 2020Updated 5 years ago
- 中国股市从1990年发展至今才仅仅28年,正处于一个茁壮成长的黄金时期。 股票市场发展至今形成了两大流派,技术派和基本派,基本分析以公司的价值为投资对象,发掘公司未来的投资价值;而技术分析则以行为为主,通过股票的历史走势,各种形态指标为研究对象;在股票市场的推进中,信息越发…☆10Apr 13, 2018Updated 8 years ago
- 法律数据挖掘☆22Jan 29, 2021Updated 5 years ago
- 文本标注工具,给文本打标签☆21Jan 9, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Python version Aho-Corasic Automaton.☆19Jul 5, 2021Updated 4 years ago
- 新词发现算法与同义词挖掘☆27Oct 24, 2017Updated 8 years ago
- 基于ElasticSearch的海量文本检索系统☆20Jun 3, 2018Updated 7 years ago
- Time entity recognition tool based on regular expression 基于正则表达式的中文时间实体识别(时间提取)工具☆25Nov 9, 2018Updated 7 years ago
- Simple arithmetic expression parser☆35Mar 24, 2022Updated 4 years ago
- seo伪原创工具GUI,SEO文章伪原创工具GUI☆20Jun 13, 2018Updated 7 years ago
- 🗣 Multilingual RDF Verbalizer – Google Summer of Code 2019☆21Mar 24, 2023Updated 3 years ago
- 北邮暑期课程信息检索与信息抽取课程设计☆17Oct 23, 2019Updated 6 years ago
- Listen, Attend and Spell - PyTorch Implementation☆17Dec 28, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 近年来,随着微信、微博、市长信箱、阳光热线等网络问政平台逐步成为政府了解民意、汇聚民智、凝聚民气的重要渠道,各类社情民意相关的文本数据量不断攀升,给以往主要依靠人工来进行留言划分和热点整理的相关部门的工作带来了极大挑战。同时,随着大数据技术的发展,建立基于自然语言处理技术的…☆36Jun 28, 2020Updated 5 years ago
- a basic bot to evaluate using neo4j graph database as chatbot memory.☆17Apr 3, 2018Updated 8 years ago
- simply implement "Personalizing Dialogue Agents: I have a dog, do you have pets too? "☆14Nov 27, 2018Updated 7 years ago
- A chatbot built based on seq2seq model with the extended ability to incorporate emotions☆17Jun 5, 2019Updated 6 years ago
- keras+tensorflow+python3下的中文分词, 大数据可训练,解决内存不够用问题☆40May 13, 2018Updated 8 years ago
- Banking chatbot based on Rasa open source machine learning tools for developers to create contextual AI assistants and chatbots that go b…☆18Aug 4, 2019Updated 6 years ago
- Keyword Spotting suitable for embedded devices.☆28Jun 22, 2020Updated 5 years ago