lrcUnlimited/check_file_system

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lrcUnlimited/check_file_system)

lrcUnlimited / check_file_system

simhash算法实现海量内容查重

☆14

Alternatives and similar repositories for check_file_system

Users that are interested in check_file_system are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

likaiguo / simhashpy
View on GitHub
使用simhash算法,快速索引和查询大量文本简历
☆21Dec 16, 2015Updated 10 years ago
15810856129 / Simhash
View on GitHub
使用Simhash对海量文本进行去重
☆12Jun 2, 2018Updated 8 years ago
guanrongjia / chinese-article-fast-compare
View on GitHub
海量中文文本快速查重
☆18Dec 16, 2018Updated 7 years ago
zhenghaishu / MachineLearning
View on GitHub
☆10Apr 8, 2018Updated 8 years ago
talent518 / tensorflow
View on GitHub
收集完成的tensorflow实例，使用图片分类模式训练并使用图片识别，支持控制台模式和B/S模式。
☆12Jul 31, 2017Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ZhangYiBo513 / Simhash-
View on GitHub
基于谷歌大规模网页去重simhash算法，对海量文章（长文本）进行去重。
☆11Dec 8, 2022Updated 3 years ago
UIC-Liu-Lab / DGA
View on GitHub
[EMNLP 2022] Adapting a Language Model While Preserving its General Knowledge
☆21Feb 12, 2023Updated 3 years ago
OkkBtc / Dy_Device_Register
View on GitHub
抖音9.1.1，其他版本没试，device_register接口fiddler抓包密文的部分为显示明文，hook XG
☆17Jul 3, 2020Updated 6 years ago
seomoz / simhash-db-py
View on GitHub
Python API for Various DB-Backed Simhash Clusters
☆64Mar 16, 2017Updated 9 years ago
fanyong920 / crawlItem
View on GitHub
用于爬取淘宝天猫网页的谷歌插件
☆20Jun 4, 2020Updated 6 years ago
Nithin-Holla / MetaWSD
View on GitHub
Repository containing code for the paper "Learning to Learn to Disambiguate: Meta-Learning for Few-Shot Word Sense Disambiguation", publi…
☆12Nov 12, 2020Updated 5 years ago
YinHeng89 / docker-dashboard
View on GitHub
docker-dashboard
☆23Jul 10, 2026Updated 2 weeks ago
LaraQianYang / Ouroboros
View on GitHub
Ouroboros: On Accelerating Training of Transformer-Based Language Models
☆10Nov 7, 2019Updated 6 years ago
zegoim / go-class
View on GitHub
ZEGO GoClass 是一款基于 ZEGO 音视频互动服务、即构互动白板服务（ZegoWhiteboard）以及 ZEGO 云端录制服务, 根据在线教育行业通用场景及需求研发出来的一套可供教育机构直接使用并开展运营的场景方案。
☆10Aug 4, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Chale-project / driver-cultivate-exam-deskapp
View on GitHub
驾校在线考试模拟系统桌面端。科目一、科目四支持语音播报、错题解答等功能，技术栈：一次开发多端适配，web端，可生成desktop安装包，主要使用lectron-builder+vue全家桶以及element-ui
☆14Aug 5, 2020Updated 5 years ago
hq20051252 / sogouWeixin
View on GitHub
爬取搜狗微信公众号
☆15Mar 9, 2015Updated 11 years ago
wweggplant / admin-monitor
View on GitHub
监控系统后台前端demo，使用vue、element-ui、echarts和mqtt
☆13Jan 29, 2024Updated 2 years ago
Flaykz / HuaweiToGPX
View on GitHub
Export HiTrack Huawei file from Watch GT to GPX file
☆20Apr 29, 2026Updated 3 months ago
liuyijiang1994 / LatticeLSTM
View on GitHub
python3 pytorch>=0.4
☆11Dec 25, 2019Updated 6 years ago
dbamman / ACL2019-literary-events
View on GitHub
☆16Feb 5, 2022Updated 4 years ago
axiaoxin-com / ratelimiter
View on GitHub
token bucket ratelimiter for nginx-lua/go/gin-middleware
☆28Jul 5, 2023Updated 3 years ago
GuohuaZhuang / deduplication-detecting
View on GitHub
文档去重功能是为了解决搜索引擎的文档语义重复的问题，方法是多重哈希下的语义指纹算法。
☆11Aug 17, 2013Updated 12 years ago
zyymax / text-similarity
View on GitHub
用TF特征向量和simhash指纹计算中文文本的相似度
☆216Aug 12, 2016Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ostarsier / api-gateway
View on GitHub
微服务的网关，包含oauth2授权、调用次数限制和服务路由
☆13Jan 12, 2017Updated 9 years ago
yli1 / CLCL
View on GitHub
☆16Mar 14, 2020Updated 6 years ago
YijunRan / Opinion-leaders-mining
View on GitHub
本文提出一种基于应答关系来挖掘QQ群中意见领袖的方法，该方法首先构建回应词词库，然后基于Aho-Corasick算法来匹配聊天文本中的回应词数据，构建出用户应答关系的网络结构，最后使用社交网络中重要节点识别的方法来发现意见领袖。该方法对QQ群中的意见领袖发现具有较高的准确率…
☆21Jul 7, 2016Updated 10 years ago
zif2016 / EasyPublisher
View on GitHub
SDK：移动端rtmp直播推送，类似于花椒、映客直播推送
☆16Feb 22, 2016Updated 10 years ago
gaowenzhen / ecs.aliyun.com
View on GitHub
这个案例是，学习阿里云--云服务器管理控制台，（以下简称控制台）这个控制台应用，是多个angular项目组合，他通过，不同的二级域名跳转打开不同的项目，比如现在这个项目是ecs.aliyun.com,默认跳转路由是#home,当然这是学习用不是挑衅阿里，因为我看他的前端皮肤…
☆15Feb 9, 2017Updated 9 years ago
winglight / algo-trader-ib
View on GitHub
☆25Updated this week
Aspose / Aspose.Words-for-Java
View on GitHub
Aspose.Words for Java Examples
☆18Apr 22, 2024Updated 2 years ago
manishravula / yolov3_tiny_pruned
View on GitHub
Attempts to prune yolo v3 tiny.
☆10Dec 13, 2018Updated 7 years ago
johnzhaoxiao / YEDDA
View on GitHub
update can run under py3, YEDDA
☆14Dec 20, 2018Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
gitdlf / Word-To-Html-Demo
View on GitHub
利用xdocreport与poi将Word文档转换成html，Word2007+使用xdocreport，Word2003-2007使用poi
☆12May 27, 2018Updated 8 years ago
jorrellz / learn-repository
View on GitHub
进击的程序员之路上各种优秀资料、神器及框架汇总
☆12Sep 18, 2018Updated 7 years ago
functail / GitChat
View on GitHub
some articles from gitchat VIP
☆14Dec 19, 2021Updated 4 years ago
mao-yuwei / paper_download
View on GitHub
download html paper to word format
☆16Jun 30, 2026Updated 3 weeks ago
MyHerux / spring-boot-2.x-scaffold
View on GitHub
Spring-boot-2.x 脚手架，组件包括：Actuator，Swagger，异常处理，HikariCP
☆16Aug 21, 2018Updated 7 years ago
plter / JiakaoParser
View on GitHub
驾考宝典扒题工具
☆14Jul 1, 2017Updated 9 years ago
TuHuiHub / ems
View on GitHub
教务排课系统
☆10Mar 27, 2019Updated 7 years ago