The fast python bm25 algorithm implemented with reverted index
☆49Aug 27, 2022Updated 3 years ago
Alternatives and similar repositories for fastbm25
Users that are interested in fastbm25 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 使用谷歌翻译进行大规模翻译,免疫封锁☆10Aug 1, 2019Updated 6 years ago
- lightweighted deep learning inference service framework☆39Jun 19, 2021Updated 4 years ago
- ☆10Nov 29, 2024Updated last year
- 基于语义信息和行为信息的歌曲推荐。包括歌曲信息爬取、数据处理、word2vec歌曲向量表示、数据存储、歌曲推荐、web可视化展示。(Python、Java)☆29Jun 17, 2022Updated 3 years ago
- ☆21Mar 19, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆15Oct 26, 2021Updated 4 years ago
- [NAACL(2019)] Generating Knowledge Graph Paths from Textual Definitions using Sequence-to-Sequence Models☆11Apr 27, 2022Updated 4 years ago
- An opinionated NLP research template☆10Aug 29, 2024Updated last year
- Python 业务开发常见错误案例集 配套源代码☆10Dec 19, 2020Updated 5 years ago
- Python coherence evaluation tool using Stanford's CoreNLP.☆10Feb 2, 2020Updated 6 years ago
- 对话改写介绍文章☆98Jun 12, 2023Updated 2 years ago
- ☆13Jan 14, 2021Updated 5 years ago
- ☆17Feb 13, 2022Updated 4 years ago
- ☆14Aug 26, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆14Jun 13, 2022Updated 3 years ago
- AI agent skills for tech startup founders — fundraising, sales, product, recruiting, engineering, legal, ops, and growth. Works with Clau…☆134Mar 16, 2026Updated 2 months ago
- ☆10Sep 17, 2016Updated 9 years ago
- Cascade bert+word vec and one layer FLAT, trained by adversarial FGM and Stochastic Weight Averaging☆23Nov 4, 2021Updated 4 years ago
- Manages vllm-nccl dependency☆18Jun 3, 2024Updated last year
- *high-load* benchmarking tool☆18May 13, 2026Updated 2 weeks ago
- ☆16Jun 14, 2024Updated last year
- TensorFlow: learn and practice☆11Aug 30, 2018Updated 7 years ago
- Docker template for basic data science packages to interface with Neo4j☆14Nov 8, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Scraped reviews from OpenRice for sentiment analysis. Formatted to use with BERT.☆11Apr 9, 2020Updated 6 years ago
- word2vec with a context based on sentences.☆15Jan 30, 2017Updated 9 years ago
- a fast implementation of BM25☆10Sep 15, 2022Updated 3 years ago
- FHIR resources Release R5☆11Jul 3, 2023Updated 2 years ago
- ☆15Aug 23, 2023Updated 2 years ago
- ☆11Nov 21, 2024Updated last year
- Semantic Proximity Search on Heterogeneous Graph by Proximity Embedding☆15Feb 20, 2018Updated 8 years ago
- 通过语音(说话)即可完成实时文本输入。通过PaddleSpeech项目二次开发 完成,支持离线脱网环境部署,支持GPU推理,目前客户端仅支持Windows。☆25Nov 25, 2022Updated 3 years ago
- 早期的计算机使用7位的ASCII编码,为了处理汉字,程序员设计了用于简体中文的GB2312和用于繁体中文的big5。 GB2312(1980年)一共收录了7445个字符,包括6763个汉字和682个其它符号。汉字区的内码范 围高字节从B0-F7,低字节从A1-FE,占用的码…☆10Sep 10, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A curated list of resources dedicated to word segmentation☆12Jan 9, 2019Updated 7 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆19Apr 8, 2026Updated last month
- ☆18Mar 7, 2022Updated 4 years ago
- This is the implementation for the paper: Sequential Recommender System based on Hierarchical Attention Network☆11Mar 13, 2021Updated 5 years ago
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆24Mar 17, 2021Updated 5 years ago
- Adaptive Scaling for Sparse Detection in Information Extraction☆31Jun 12, 2018Updated 7 years ago
- 本项目由三个模块构成。意图识别:判断用户的意图是业务型还是闲聊型;模型检索:该部分构建一个语料库,当用户 发起新的query(通过意图识别判断为业务型对话)时,为用户匹配query检索的最佳response,使用HSWN进行召回(粗排), 然后构建句子的相似度,并利用Lig…☆12Feb 18, 2021Updated 5 years ago