中文恶意网页检测数据 集与检测方法
☆21Mar 4, 2025Updated last year
Alternatives and similar repositories for Chinese_Malicious_Web_Pages_Dataset_And_Detection
Users that are interested in Chinese_Malicious_Web_Pages_Dataset_And_Detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 本列表收录互联网上常见的恶意网站网址。This list contains URLs of malicious websites commonly found on the Internet.☆46Jan 21, 2018Updated 8 years ago
- 本科毕设:URL恶意性检测,基于字符串本身进行特征提取, 基于sklearn库的机器学习模型进行分类(附实验数据于data文件夹)☆62May 14, 2020Updated 5 years ago
- 机器学习检测恶意URL改进版☆28Aug 20, 2020Updated 5 years ago
- 将报表数据转换格式并入库时遇到许多重复性工作,于是用Python写了一些脚本进行自动化处理,并用PySide2做了GUI界面,做成了一个工具合集☆10Sep 29, 2021Updated 4 years ago
- 77,370条敏感文本和22,823个敏感词的高质量数据集,并进行分类☆17Mar 18, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 基于tensorflow的中文文本分类(复 旦中文语料)☆21Nov 3, 2020Updated 5 years ago
- Datacon2023 邮件安全赛道 赛题1 新型钓鱼邮件的检测 示例数据集☆30Nov 16, 2023Updated 2 years ago
- 中文文本摘要,基于pytorch,采用LCSTS数据集☆21Nov 11, 2021Updated 4 years ago
- 英文文献的《中国图书馆分类法》自动标注小程序☆12Oct 29, 2024Updated last year
- Malicious Web Sites Detection using Suspicious URL☆76Oct 2, 2020Updated 5 years ago
- ☆10Jul 23, 2019Updated 6 years ago
- An evaluation bentchmark for classical Chinese☆19Dec 13, 2023Updated 2 years ago
- Klara docker compose☆11May 19, 2020Updated 5 years ago
- Insert heart-shaped Toggle Switch within Streamlit apps! 🧡☆11Feb 27, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 【辣鸡采集,采集世界上所有辣鸡数据 欢迎大家来采集】,[collect web][ai collect] [auto collect] [爬虫数据][采集数据][在线采集][web采集][collect][spider],go全开源 最新算法采集 ,全智能采集 不用写规则 …☆31Aug 22, 2024Updated last year
- 将Python库ttkbootstrap的ttkcreator建造器汉化,并修改菜单和加入工具提示☆11Apr 17, 2022Updated 4 years ago
- reid可视化重识别系统☆10Dec 3, 2023Updated 2 years ago
- 毕业设计:互联网新闻热点抽取系统☆10May 21, 2022Updated 3 years ago
- y-trainerY-Trainer 是一个LLM模型微调训练框架。 📊 核心优势: 📉 精准对抗过拟合: 专门优化,有效解决SFT中的过拟合难题。 🧩 突破遗忘瓶颈: 无需依赖通用语料,即可卓越地保留模型的泛化能力,守住核心竞争力的同时实现专项提升!🏆☆43Mar 3, 2026Updated last month
- A collection of datafiles created from the library of congress open data dump☆20May 19, 2017Updated 8 years ago
- Cybersecurity Ontology (CyberOnto) and Situational Awareness (CyberSA) help teamwork in Cyber Incident Responses, Control, Containment, a…☆10Sep 15, 2022Updated 3 years ago
- 舆情分析系统后端☆11Jun 21, 2021Updated 4 years ago
- A large corpus of Chinese fixed phrases and idioms scraped from a reputable educational website (30310 instances). 一个大型的中文成语及俗语语料库,内含3031…☆13Oct 29, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆10Aug 14, 2019Updated 6 years ago
- web端检测url是否可访问☆33Jan 4, 2023Updated 3 years ago
- Threat Detection Rules (Snort/Sigma/Yara)☆14Jan 23, 2024Updated 2 years ago
- 爬取3000+谣言新闻,并对新闻信息进行建模、分类与预测。☆11Sep 12, 2018Updated 7 years ago
- “达观杯”长文本智能处理挑战赛。达观数据提供了一批长文本数据和分类信息,希望选手动用自己的智慧,结合当下最先进的NLP和人工智能技术,深入分析文本内在结构和语义信息,构建文本分类模型,实现精准分类。☆10Jul 20, 2018Updated 7 years ago
- TensorFlow: learn and practice☆11Aug 30, 2018Updated 7 years ago
- ☆23Jun 2, 2019Updated 6 years ago
- Scraped reviews from OpenRice for sentiment analysis. Formatted to use with BERT.☆11Apr 9, 2020Updated 6 years ago
- Script from the paper generating encrypted network. Dataset☆11Sep 1, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Data Driven AI-based Honeypot☆25Apr 15, 2026Updated 2 weeks ago
- ☆11Nov 21, 2024Updated last year
- The runner-up solution of AICITY Challenge Track2 (Vehicle Re-Identification) at CVPR 2021 Workshop.☆21May 3, 2022Updated 3 years ago
- 早期的计算机使用7位的ASCII编码,为了处理汉字,程序员设计了用于简体中文的GB2312和用于繁体中文的big5。 GB2312(1980年)一共收录了7445个字符,包括6763个汉字和682个其它符号。汉字区的内码范围高字节从B0-F7,低字节从A1-FE,占用的码…☆10Sep 10, 2017Updated 8 years ago
- We apply from rule-based approach to BERT for a sentiment analysis task on financial texts.☆15May 13, 2022Updated 3 years ago
- A curated list of resources dedicated to word segmentation☆12Jan 9, 2019Updated 7 years ago
- Codes of the 3rd place of Track 1, AI City Challenge 2022☆19Jul 24, 2022Updated 3 years ago