tf-idf 模型封装类,包含计算所有文档的tf-idf值,实现了基于tf-idf搜索引擎功能。根据query,计算与每个文档的相似度,返回与query相似度最高的topk文档
☆15Nov 20, 2020Updated 5 years ago
Alternatives and similar repositories for tf-idf
Users that are interested in tf-idf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Use multi-threaded crawler to crawl the idiom data☆14Dec 11, 2020Updated 5 years ago
- python class for elasticsearch , including add, batch add, update, delete, query, and scan query. also with a demo that put Wikipedia in…☆17Sep 3, 2022Updated 3 years ago
- A based-bert baseline for Chinese idiom cloze test with pytorch.☆18Dec 24, 2020Updated 5 years ago
- semantic similarity, word2vec + wmd, bert+wmd, pytorch☆31Jan 29, 2024Updated 2 years ago
- Datafountain-Epidemic government affairs quiz assistant competition. We divided this task into two parts: document retrieval and answer e…☆14Aug 21, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆12Sep 2, 2021Updated 4 years ago
- Sequence Tagging for Biomedical Extractive Question Answering (Bioinformatics'2020)☆10Jul 3, 2023Updated 2 years ago
- [AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model☆13Dec 17, 2023Updated 2 years ago
- Data and code for the paper Causal Reasoning of Entities and Events in Procedural Texts.☆11May 26, 2023Updated 3 years ago
- DataFountain 疫情政务问答助手解决方案分享☆16May 2, 2020Updated 6 years ago
- 文档记录☆15Mar 16, 2021Updated 5 years ago
- ☆11Feb 21, 2024Updated 2 years ago
- Implementation of the paper 'Improve Discourse Dependency Parsing with Contextualized Representations', Findings of NAACL 2022☆13Jul 15, 2022Updated 3 years ago
- 2021 语言与智能技术竞赛关系 篇章级关系抽取☆17Sep 8, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 从零预训练LLM、SFT、RLHF、DPO笔记整理+面试问题☆21Sep 2, 2024Updated last year
- ☆16Jan 31, 2023Updated 3 years ago
- This repository is the implementation of "Top-down RST Parsing Utilizing Granularity Levels in Documents" published at AAAI 2020.☆19Dec 14, 2020Updated 5 years ago
- ☆15Mar 6, 2020Updated 6 years ago
- Implementation of AAAI2021 paper "Writing Polishment with Simile: Task, Dataset and A Neural Approach"☆21Dec 25, 2020Updated 5 years ago
- The objective of this project is to classify whether upcoming product will have positive or negative Sentiment.☆11May 18, 2019Updated 7 years ago
- ☆21Oct 15, 2022Updated 3 years ago
- A chinese simile recognition dataset of "Xiang".☆24Oct 5, 2022Updated 3 years ago
- A task relevant entity linking toolkit☆22Apr 2, 2022Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆26Dec 12, 2024Updated last year
- scrap wiki how for info☆26Jun 24, 2020Updated 5 years ago
- MultiSpanQA: A Dataset for Multi-Span Question Answering☆27Jan 24, 2026Updated 4 months ago
- A Specialist-annotated Dataset for Medical-domain Chinese Spelling Correction☆38Jun 6, 2022Updated 4 years ago
- LIQUID: A Framework for List Question Anwering Dataset Generation (AAAI 2023)☆27Jun 7, 2023Updated 3 years ago
- Resource of School of Software Engineering, South China University of Technology.☆21Feb 13, 2022Updated 4 years ago
- This is the pytorch implementation of the long paper on ACL 2020: A Self-Training Method for Machine Reading Comprehension with Soft Evid…☆33Aug 14, 2020Updated 5 years ago
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue☆37May 26, 2025Updated last year
- BBPE 底层实现☆38Apr 29, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ChineseBert用于中文拼写纠错☆43Mar 14, 2023Updated 3 years ago
- Reference Implementation for WSDM 2018 Paper "Hyperbolic Representation Learning for Fast and Efficient Neural Question Answering"☆68Nov 16, 2018Updated 7 years ago
- [AAAI 2024] LLMEval Phase II dataset — professional domain evaluation across 12 academic disciplines☆71May 21, 2026Updated 3 weeks ago
- PyTorch implementation of paper "Mining Entity Synonyms with Efficient Neural Set Generation" in AAAI 2019☆66Nov 26, 2021Updated 4 years ago
- ☆127Jun 11, 2025Updated last year
- https://transformer-circuits.pub/2025/attribution-graphs/methods.html☆99Mar 27, 2025Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆90Mar 24, 2024Updated 2 years ago