lrcUnlimited / check_file_systemView external linksLinks
simhash算法实现海量内容查重
☆14Apr 23, 2016Updated 9 years ago
Alternatives and similar repositories for check_file_system
Users that are interested in check_file_system are comparing it to the libraries listed below
Sorting:
- 使用simhash算法,快速索引和查询大量文本简历☆21Dec 16, 2015Updated 10 years ago
- 使用Simhash对海量文本进行去重☆12Jun 2, 2018Updated 7 years ago
- 海量中文文本快速查重☆18Dec 16, 2018Updated 7 years ago
- some articles from gitchat VIP☆14Dec 19, 2021Updated 4 years ago
- ☆10May 28, 2024Updated last year
- 图像分类系统,采用HOG+SVM/Sotfmax分类器,神经网络采用卷积神经网络和34层的深度参查网络,利用基于tensorflow的tflearn实现。☆10May 23, 2017Updated 8 years ago
- This is for Meridian (Traditional Chinese Medicine conception) prediction by machining learning method.☆11Sep 30, 2019Updated 6 years ago
- Online Comment Toxicity Analysis using averaging the Classifiers and used both char level as well as word level n-grams.☆10Mar 31, 2018Updated 7 years ago
- 一个基于trie树的具有联想功能的文本编辑器。采用python和pyqt☆10Sep 7, 2016Updated 9 years ago
- Codes for the Nature Conservancy Fish classification challenge☆10Jan 2, 2017Updated 9 years ago
- This repository contains comprehensive pricing and configuration data for LLMs. It powers cost attribution for 200+ enterprises running 4…☆52Updated this week
- 收集完成的tensorflow实例,使用图片分类模式训练并使用图片识别,支持控制台模式和B/S模式。☆12Jul 31, 2017Updated 8 years ago
- A small database to test different machine learning tasks. It contains simple shapes of different colors.☆11Sep 18, 2022Updated 3 years ago
- h5涂色小游戏☆12Jan 1, 2023Updated 3 years ago
- 监控系统后台前端demo,使用vue、element-ui、echarts和mqtt☆13Jan 29, 2024Updated 2 years ago
- Chrome Extension demo for a tutorial☆12Mar 5, 2023Updated 2 years ago
- ☆12Aug 12, 2024Updated last year
- Price Spider is a Python tool to get price & promotion from JD, Tmall, Amazon, BeiBei☆10Jun 14, 2019Updated 6 years ago
- 文档去重功能是为了解决搜索引擎的文档语义重复的问题,方法是多重哈希下的语义指纹算法。☆12Aug 17, 2013Updated 12 years ago
- ZEGO GoClass 是一款基于 ZEGO 音视频互动服务、即构互动白板服务(ZegoWhiteboard)以及 ZEGO 云端录制服务, 根据在线教育行业通用场景及需求研发出来的一套可供教育机构直接使用并开展运营的场景方案。☆10Aug 4, 2022Updated 3 years ago
- SQL injection detection engine by tokenzing and syntax analysis, like SQLChop☆10May 8, 2017Updated 8 years ago
- ☆15Sep 23, 2020Updated 5 years ago
- 研究组的材料分享,例如:发明专利撰写方法,发表的论文,爬虫方法等☆13May 3, 2017Updated 8 years ago
- A python based SDK developed for interacting with GMX v2☆20Jan 17, 2025Updated last year
- 【字节Lark】- 基础架构中一些规范:git流程规范、IDL描述文件规范和管理、中间件选型使用规范、RPC通信框架设计和规范、服务治理、service mesh/服务网格、serverless/无服务化函数计算、k8s下的云原生、kernel内核虚拟化☆12Sep 13, 2024Updated last year
- sougou医学词库爬取☆13Nov 21, 2019Updated 6 years ago
- Super simple KeyValue store for python, backed by sqlite.☆13Apr 18, 2024Updated last year
- Deep Learning based autocomplete for search bars☆11Mar 18, 2022Updated 3 years ago
- Generate vulnerability reports using ChatGPT automatically.使用chatGPT自动生成漏洞报告。☆12Mar 11, 2023Updated 2 years ago
- Company website☆12Mar 30, 2015Updated 10 years ago
- 基于粒子群算法的自动组卷考试系统☆13Jan 5, 2018Updated 8 years ago
- Demo of using WASM to sandbox Plotly execution☆19Mar 30, 2025Updated 10 months ago
- Build a semantic search application with deep learning models.☆15Dec 3, 2024Updated last year
- Webshell Detection Based on Deep Learning☆12Jun 12, 2018Updated 7 years ago
- Attempts to prune yolo v3 tiny.☆10Dec 13, 2018Updated 7 years ago
- ☆11Oct 16, 2017Updated 8 years ago
- My implementation of 《Synthesizing Filamentary Structured Images with GANs》☆13Jun 1, 2018Updated 7 years ago
- ☆14Aug 19, 2018Updated 7 years ago
- 企业事件抽取☆13May 20, 2021Updated 4 years ago