simhash算法实现海量内容查重
☆14Apr 23, 2016Updated 10 years ago
Alternatives and similar repositories for check_file_system
Users that are interested in check_file_system are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 使用simhash算法,快速索引和查询大量文本简历☆21Dec 16, 2015Updated 10 years ago
- 使用Simhash对海量文本进行去重☆12Jun 2, 2018Updated 8 years ago
- semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language …☆27Jul 25, 2024Updated last year
- 海量中文文本快速查重☆18Dec 16, 2018Updated 7 years ago
- 基于gensim模块的中文句子相似度计算☆52Aug 1, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Price Spider is a Python tool to get price & promotion from JD, Tmall, Amazon, BeiBei☆10Jun 14, 2019Updated 7 years ago
- 通过微信接口抓取公众号文章☆13Mar 9, 2015Updated 11 years ago
- 一个全网爬的多线程爬虫☆18Dec 2, 2016Updated 9 years ago
- Half-edge 3D mesh in python/cython. Supports dynamic manipulation operators.☆19Jun 20, 2019Updated 6 years ago
- ☆20Aug 30, 2022Updated 3 years ago
- 敏感词检查及过滤扩展包,采用 DFA 算法☆18Jul 17, 2021Updated 4 years ago
- 推荐系统,web端展示基于django☆12Nov 1, 2017Updated 8 years ago
- Repository containing code for the paper "Learning to Learn to Disambiguate: Meta-Learning for Few-Shot Word Sense Disambiguation", publi…☆12Nov 12, 2020Updated 5 years ago
- 一个基于trie树的具有联想功能的文本编辑器。采用python和pyqt☆10Sep 7, 2016Updated 9 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ZEGO GoClass 是一款基于 ZEGO 音视频互动服务、即构互动白板服务(ZegoWhiteboard)以及 ZEGO 云端录制服务, 根据在线教育行业通用场景及需求研发出来的一套可供教育机构直接使用并开展运营的场景方案。☆10Aug 4, 2022Updated 3 years ago
- token bucket ratelimiter for nginx-lua/go/gin-middleware☆28Jul 5, 2023Updated 2 years ago
- A Knowledge Graph in the computer network field from scratch. It contains a website GUI with the Neo4j graph database, achieving entity e…☆15Nov 4, 2020Updated 5 years ago
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆15Nov 11, 2024Updated last year
- python3 pytorch>=0.4☆11Dec 25, 2019Updated 6 years ago
- online-exam-backend是一个在线考试系统的后端模块。基于Jersey+Spring实现的的restful服务,主要包括用户管理、在线考试,自动批卷、成绩管理、错题管理、留言板、试卷管理、题库管理、试题科目维护等功能。☆11Mar 19, 2021Updated 5 years ago
- A way to turn markdown into HTML and ebooks☆102Sep 26, 2013Updated 12 years ago
- 多源多分类图文数据监控平台设计与实现(Python、Django、爬虫、Echarts等技术)☆19Jul 4, 2018Updated 7 years ago
- 驾校在线考试模拟系统桌面端。科目一、科目四支持语音播报、错题解答等功能,技术栈:一次开发多端适配,web端,可生成desktop安装包,主要使用lectron-builder+vue全家桶以及element-ui☆14Aug 5, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A ctypes-based python module that provides access to Bob Jenkins' hash function.☆17Dec 3, 2009Updated 16 years ago
- 微服务的网关,包含oauth2授权、调用次数限制和服务路由☆13Jan 12, 2017Updated 9 years ago
- ☆16Mar 14, 2020Updated 6 years ago
- paascloud配套demo☆13May 19, 2018Updated 8 years ago
- SDK:移动端rtmp直播推送,类似于花椒、映客直播推送☆16Feb 22, 2016Updated 10 years ago
- some articles from gitchat VIP☆14Dec 19, 2021Updated 4 years ago
- Attempts to prune yolo v3 tiny.☆10Dec 13, 2018Updated 7 years ago
- Aspose.Words for Java Examples☆18Apr 22, 2024Updated 2 years ago
- Tools for auditing autocomplete on Google and Bing☆28Jun 11, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- update can run under py3, YEDDA☆14Dec 20, 2018Updated 7 years ago
- 进击的程序员之路上各种优秀资料、神器及框架汇总☆12Sep 18, 2018Updated 7 years ago
- 慕课商场高级课程代码☆13Feb 14, 2017Updated 9 years ago
- Spring-boot-2.x 脚手架,组件包括:Actuator,Swagger,异常处理,HikariCP☆16Aug 21, 2018Updated 7 years ago
- 企业事件抽取☆13May 20, 2021Updated 5 years ago
- 驾考宝典扒题工具☆13Jul 1, 2017Updated 8 years ago
- 【字节Lark】- 基础架构中一些规范:git流程规范、IDL描述文件规范和管理、中间件选型使用规范、RPC通信框架设计和规范、服务治理、service mesh/服务网格、serverless/无服务化函数计算、k8s下的云原生、kernel内核虚拟化☆12Apr 7, 2026Updated 2 months ago