tf-idf 模型封装类,包含计算所有文档的tf-idf值,实现了基于tf-idf搜索引擎功能。根据query,计算与每个文档的相似度,返回与query相似度最高的topk文档
☆16Nov 20, 2020Updated 5 years ago
Alternatives and similar repositories for tf-idf
Users that are interested in tf-idf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Use multi-threaded crawler to crawl the idiom data☆14Dec 11, 2020Updated 5 years ago
- python class for elasticsearch , including add, batch add, update, delete, query, and scan query. also with a demo that put Wikipedia in…☆17Sep 3, 2022Updated 3 years ago
- A based-bert baseline for Chinese idiom cloze test with pytorch.☆18Dec 24, 2020Updated 5 years ago
- semantic similarity, word2vec + wmd, bert+wmd, pytorch☆31Jan 29, 2024Updated 2 years ago
- Datafountain-Epidemic government affairs quiz assistant competition. We divided this task into two parts: document retrieval and answer e…☆14Aug 21, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- NCNN Android SinglePoseEstimation Demo☆15Apr 26, 2020Updated 6 years ago
- Realtime Pose Estimation NCNN ONNX☆24Apr 29, 2020Updated 6 years ago
- 基于jieba分词和lda模型的主题分析☆19Apr 20, 2019Updated 7 years ago
- ☆13Sep 2, 2021Updated 4 years ago
- TF-IDF+Word2vec做文本相似度计算,最好是长文本☆24Dec 18, 2019Updated 6 years ago
- 以聚类算法、LDA主题模型、分类器为基础,完成对Twitter语料的基于地理位置的主题事件挖掘,并对主题事件进行细粒度的情绪分析☆36Jul 29, 2018Updated 7 years ago
- Sequence Tagging for Biomedical Extractive Question Answering (Bioinformatics'2020)☆11Jul 3, 2023Updated 2 years ago
- [AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model☆14Dec 17, 2023Updated 2 years ago
- Implementation of our paper "Towards Consistent Document-Level Entity Linking: Joint Models for Entity Linking and Coreference Resolution…☆12Nov 13, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Data and code for the paper Causal Reasoning of Entities and Events in Procedural Texts.☆12May 26, 2023Updated 3 years ago
- DataFountain 疫情政务问答助手解决方案分享☆16May 2, 2020Updated 6 years ago
- 文档记录☆15Mar 16, 2021Updated 5 years ago
- Natural Language Processing (NLP) and Large Language Models (LLM) with Fine-Tuning LLM and make Chatbot Question answering (QA) with LoRA…☆13Jan 20, 2024Updated 2 years ago
- Implementation of the paper 'Improve Discourse Dependency Parsing with Contextualized Representations', Findings of NAACL 2022☆14Jul 15, 2022Updated 3 years ago
- [Course] Simple database in C++ (Database 2017)☆22Apr 1, 2019Updated 7 years ago
- 从零预训练LLM、SFT、RLHF、DPO笔记整理+面试问题☆22Sep 2, 2024Updated last year
- ☆17Jan 31, 2023Updated 3 years ago
- This repository is the implementation of "Top-down RST Parsing Utilizing Granularity Levels in Documents" published at AAAI 2020.☆20Dec 14, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16Mar 6, 2020Updated 6 years ago
- Implementation of AAAI2021 paper "Writing Polishment with Simile: Task, Dataset and A Neural Approach"☆21Dec 25, 2020Updated 5 years ago
- A chinese simile recognition dataset of "Xiang".☆24Oct 5, 2022Updated 3 years ago
- A task relevant entity linking toolkit☆23Apr 2, 2022Updated 4 years ago
- ☆27Dec 12, 2024Updated last year
- MultiSpanQA: A Dataset for Multi-Span Question Answering☆28Jan 24, 2026Updated 4 months ago
- Moss Vortex is a lightweight and high-performance deployment and inference backend engineered specifically for MOSS 003, providing a weal…☆37Apr 25, 2023Updated 3 years ago
- A Specialist-annotated Dataset for Medical-domain Chinese Spelling Correction☆37Jun 6, 2022Updated 3 years ago
- LIQUID: A Framework for List Question Anwering Dataset Generation (AAAI 2023)☆28Jun 7, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Resource of School of Software Engineering, South China University of Technology.☆21Feb 13, 2022Updated 4 years ago
- This is the pytorch implementation of the long paper on ACL 2020: A Self-Training Method for Machine Reading Comprehension with Soft Evid…☆34Aug 14, 2020Updated 5 years ago
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue