本地语料很多?爬到的文档很多?运行出无序结果很多?我们经常面对一些搜索引擎无法检索的文本/或其它程序运行结果,想要对这些内容进行检索、按相关性排序等。MySearch是用python3写的,用于方便中英文检索的小脚本,中文分词基于jieba、pkuseg,相关性排序基于sklearn的tf-idf
☆20Feb 2, 2019Updated 7 years ago
Alternatives and similar repositories for MySearch
Users that are interested in MySearch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 简单搜索引擎,实现了拼写检查、倒排索引 、文档排序☆18May 7, 2019Updated 6 years ago
- Scripts to train a seq2seq model using tensorflow 2☆11Dec 9, 2019Updated 6 years ago
- phonetic similarity algorithms☆13Jun 19, 2018Updated 7 years ago
- 新词发现,信息熵,左右互信息☆16Nov 3, 2018Updated 7 years ago
- Generating NEW Reuters articles from Reuters articles.☆16Jan 10, 2017Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 伪原创相关☆14Sep 4, 2019Updated 6 years ago
- 一个基于elasticsearch开发的搜索引擎网站☆14Nov 22, 2022Updated 3 years ago
- PyTorch implementation for NAACL 2022 paper: "Document-Level Relation Extraction with Sentences Importance Estimation and Focusing"☆17Apr 29, 2022Updated 3 years ago
- Experiment with JNI access to some Kaldi functions.☆12Dec 31, 2018Updated 7 years ago
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 6 years ago
- 问答摘要/seq2seq/PGN/Bert_sum/UniLM☆19Oct 4, 2020Updated 5 years ago
- QA Server Based Chinese CQA Site☆12Jul 14, 2021Updated 4 years ago
- 一个基于原生微信小程序的英语学习平台(毕设)☆13May 17, 2023Updated 2 years ago
- Spoken Language Identification from Short Utterances☆13Jul 6, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Simple scripts to generate and use an Annoy index and lmdb map☆28Jan 4, 2018Updated 8 years ago
- Feature extraction for accented-speech or pathological speech☆18Apr 2, 2019Updated 7 years ago
- Amazon SSML cheatsheet☆16Nov 9, 2018Updated 7 years ago
- 医疗命名实体识别, CRF,☆13Jun 26, 2019Updated 6 years ago
- 文章标签抽取☆16Dec 17, 2018Updated 7 years ago
- Bi-directional streaming speech-to-text service using Cloud ASRs☆15Aug 23, 2017Updated 8 years ago
- finetune the chain model based on cvte open source model without traing any GMM for frame alignment☆13Aug 6, 2020Updated 5 years ago
- Teaching machines to spell with deep learning (acc=>80%) e.g. a model hears "pɹˈaʊd˺ɚ" and writes "prowder" (but it should be "prouder")☆19Jun 1, 2017Updated 8 years ago
- 中国股市从1990年发展至今才仅仅28年,正处于一个茁壮成长的黄金时期。 股票市场发展至今形成了两大流派,技术派和基本派,基本分析以公司的价值为投资对象,发掘公司未来的投资价值;而技术分析则以行为为主,通过股票的历史走势,各种形态指标为研究对象;在股票市场的推进中,信息越发…☆10Apr 13, 2018Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 法律数据挖掘☆22Jan 29, 2021Updated 5 years ago
- 文本标注工具,给文本打标签☆21Jan 9, 2020Updated 6 years ago
- 用于分库分表,表结构完全相同情况下从Mysql数据到导入数据到Elasticsearch搜索引擎。☆21Apr 7, 2016Updated 10 years ago
- Python version Aho-Corasic Automaton.☆19Jul 5, 2021Updated 4 years ago
- ☆12Feb 14, 2019Updated 7 years ago
- 新词发现算法与同义词挖掘☆27Oct 24, 2017Updated 8 years ago
- [WWW 2022] Zero-Shot Stance Detection via Contrastive Learning☆26Dec 6, 2022Updated 3 years ago
- 北邮暑期课程信息检索与信息抽取课程设计☆16Oct 23, 2019Updated 6 years ago
- Listen, Attend and Spell - PyTorch Implementation☆17Dec 28, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 近年来,随着微信、微博、市长信箱、阳光热线等网络问政平台逐步成为政府了解民意、汇聚民智、凝聚民气的重要渠道,各类社情民意相关的文本数据量不断攀升,给以往主要依靠人工来进行留言划分和热点整理的相关部门的工作带来了极大挑战。同时,随着大数据技术的发展,建立基于自然语言处理技术的…☆36Jun 28, 2020Updated 5 years ago
- a basic bot to evaluate using neo4j graph database as chatbot memory.☆17Apr 3, 2018Updated 8 years ago
- 基于Python的Sqlite3助手类库,采用连贯操作实 现数据库的CURD功能。☆27Aug 6, 2020Updated 5 years ago
- simply implement "Personalizing Dialogue Agents: I have a dog, do you have pets too? "☆14Nov 27, 2018Updated 7 years ago
- Implementation of Dual Co-Matching Network for Multi-choice Reading Comprehension☆17Nov 15, 2019Updated 6 years ago
- A chatbot built based on seq2seq model with the extended ability to incorporate emotions☆17Jun 5, 2019Updated 6 years ago
- 同义词扩展☆27Feb 16, 2016Updated 10 years ago