本地语料很多?爬到的文档很多?运行出无序结果很多?我们经常面对一些搜索引擎无法检索的文本/或其它程序运行结果,想要对这些内容进行检索、按相关性排序等。MySearch是用python3写的,用于方便中英文检索的小脚本,中文分词基于jieba、pkuseg,相关性排序基于sklearn的tf-idf
☆20Feb 2, 2019Updated 7 years ago
Alternatives and similar repositories for MySearch
Users that are interested in MySearch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 简单搜索引擎,实现了拼写检查、倒排索引 、文档排序☆19May 7, 2019Updated 7 years ago
- Scripts to train a seq2seq model using tensorflow 2☆11Dec 9, 2019Updated 6 years ago
- phonetic similarity algorithms☆13Jun 19, 2018Updated 7 years ago
- 问答系统☆13Apr 3, 2019Updated 7 years ago
- 新词发现,信息熵,左右互信息☆16Nov 3, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Generating NEW Reuters articles from Reuters articles.☆16Jan 10, 2017Updated 9 years ago
- A python wrapper for kaldi-online-decoder using Cython☆12Sep 1, 2017Updated 8 years ago
- 一个基于elasticsearch开发的搜索引擎网站☆14Nov 22, 2022Updated 3 years ago
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 6 years ago
- 一个基于原生微信小程序的英语学习平台(毕设)☆13May 17, 2023Updated 2 years ago
- Turn Chinese natural language into structured data 中文自然语言理解,并支持spacy☆13Jul 9, 2024Updated last year
- Simple scripts to generate and use an Annoy index and lmdb map☆28Jan 4, 2018Updated 8 years ago
- 文章标签抽取☆16Dec 17, 2018Updated 7 years ago
- Bi-directional streaming speech-to-text service using Cloud ASRs☆15Aug 23, 2017Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Teaching machines to spell with deep learning (acc=>80%) e.g. a model hears "pɹˈaʊd˺ɚ" and writes "prowder" (but it should be "prouder")☆19Jun 1, 2017Updated 8 years ago
- 中国股市从1990年发展至今才仅仅28年,正处于一个茁壮成长的黄金时期。 股票市场发展至今形成了两大流派,技术派和基本派,基本分析以公司的价值为投资对象,发掘公司未来的投资价值;而技术分析则以行为为主,通过股票的历史走势,各种形态指标为研究对象;在股票市场的推进中,信息越发…☆10Apr 13, 2018Updated 8 years ago
- 法律数据挖掘☆22Jan 29, 2021Updated 5 years ago
- 文本标注工具,给文本打标签☆21Jan 9, 2020Updated 6 years ago
- Python tools for performing similarity searches on text documents.☆24Dec 9, 2016Updated 9 years ago
- 用于分库分表,表结构完全相同情况下从Mysql数据到导入数据到Elasticsearch搜索引擎。☆21Apr 7, 2016Updated 10 years ago
- 社会信息检索作业,实现简单的搜索引擎,计算TFIDF值以及两个句子的相似度☆19Apr 4, 2018Updated 8 years ago
- Simple arithmetic expression parser☆35Mar 24, 2022Updated 4 years ago
- 🗣 Multilingual RDF Verbalizer – Google Summer of Code 2019☆21Mar 24, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 北邮暑期课程信息检索与信息抽取课程设计☆17Oct 23, 2019Updated 6 years ago
- rasa_contrib is a addon package for rasa. It provide some useful/powerful addition components☆21Dec 8, 2022Updated 3 years ago
- 近年来,随着微信、微博、市长信箱、阳光热线等网络问政平台逐步成为政府了解民意、汇聚民智、凝聚民气的重要渠道,各类社情民意相关的文本数据量不断攀升,给以往主要依靠人工来进行留言划分和热点整理的相关部门的工作带来了极大挑战。同时,随着大数据技术的发展,建立基于自然语言处理技术的…☆36Jun 28, 2020Updated 5 years ago
- a basic bot to evaluate using neo4j graph database as chatbot memory.☆17Apr 3, 2018Updated 8 years ago
- Implementation of Dual Co-Matching Network for Multi-choice Reading Comprehension☆17Nov 15, 2019Updated 6 years ago
- A chatbot built based on seq2seq model with the extended ability to incorporate emotions☆17Jun 5, 2019Updated 6 years ago
- keras+tensorflow+python3下的中文分词, 大数据可训练,解决内存不够用问题☆40May 13, 2018Updated 7 years ago
- Banking chatbot based on Rasa open source machine learning tools for developers to create contextual AI assistants and chatbots that go b…☆18Aug 4, 2019Updated 6 years ago
- Keyword Spotting suitable for embedded devices.☆28Jun 22, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 24-hour Automatic Speech Recognition☆27Jun 4, 2021Updated 4 years ago
- 同义词扩展☆27Feb 16, 2016Updated 10 years ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆28May 25, 2023Updated 2 years ago
- A Study on Stock Price Prediction and Quantitative Strategy - Based on Deep Learning 『深層学習に基づく株価予測とクオンツ戦略に関する研究』基于深度学习的股票价格预测和量化策略研究☆37Mar 30, 2022Updated 4 years ago
- Pronunciation-assisted Subword Modeling☆31May 30, 2019Updated 6 years ago
- Keras implementation of the Smart Reply[1] Google system paper.☆25Aug 9, 2016Updated 9 years ago
- Render The Art of the Command Line to standalone HTML via R Markdown☆10Jun 1, 2019Updated 6 years ago