语料数据和词库收集:中文、英文停用词,情感分析,分类词典,敏感词库(违禁词,审查词)。stop words, sentiment analysis, thesaurus, censorship/sensitive word
☆35Feb 9, 2026Updated 4 months ago
Alternatives and similar repositories for data-corpus
Users that are interested in data-corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 灾害天气图像识别(含数据集)☆13Mar 19, 2024Updated 2 years ago
- 服务端,iOS版,捕获音视频数据,编码并发送给client。☆27Dec 4, 2017Updated 8 years ago
- 抖助理,批量去水印,支持 快手、抖音,YouTube、Instagram(包含快拍)、Twitter、TK、Threads、Facebook、Vimeo、afreecatv、Tumblr、Triller、Likee 、Twitch、Pinterest、Snapchat、Red…☆17Apr 20, 2024Updated 2 years ago
- 用于生成文本纠错模型(如Gector)需要的大量数据。☆14Jan 5, 2023Updated 3 years ago
- 发掘您的创造力,打造艺术世界☆11Mar 13, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A powerful AI content creation platform with AI writing, image generation, video generation, PPT generation tools, and one-click multi-pl…☆76May 13, 2026Updated 3 weeks ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- CHisIEC An Information Extraction Corpus for Ancient Chinese History☆22Nov 25, 2025Updated 6 months ago
- 基于Textrank的关键字提取 & 摘要提取☆17Sep 15, 2023Updated 2 years ago
- Awesome-MCP-Scaffold 是一个开箱即用的 MCP 服务器开发脚手架,让你能够: 🚀 5分钟启动:从零到运行的完整 MCP 服务器 🤖 10分钟MCP开发:内置提示词和范例,基于 Cursor IDE 一句话完成MCP Server tools开发 �…☆26Jul 11, 2025Updated 10 months ago
- capture screen(full /area) and mic, encode as h264 and aac, transfer as rtmp☆11Jan 21, 2017Updated 9 years ago
- Turn Dify API into OpenAI API schema☆17Aug 16, 2024Updated last year
- 自然语言处理之中文文本分类(以垃圾短信识别为例)☆24Jun 4, 2020Updated 6 years ago
- 基于深度学习的车牌识别系统☆28Mar 31, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Chinese character variant converter. 中文异体字转换器。☆23Oct 17, 2025Updated 7 months ago
- Decoding h264 rtsp stream with libavformat☆15Nov 19, 2015Updated 10 years ago
- Ezviz SDK,功能更强大的开发套件,可以实现预览、回放、配网、对讲、设备控制、oAuth授权等功能☆14May 29, 2026Updated last week
- Yet Another Chinese Spelling Check Dataset (YACSC)☆21Oct 25, 2023Updated 2 years ago
- Otstar's Space的源码,个人主页。☆11May 17, 2026Updated 3 weeks ago
- Trimmed and cleaned up Samsung device skins for the Android Emulator☆16Mar 9, 2022Updated 4 years ago
- 本文提出了一个基于“文心一言”的中国LLMs的安全评估基准,其中包括8种典型的安全场景和6种指令攻击类型。此外,本文还提出了安全评估的框架和过程,利用手动编写和收集开源数据的测试Prompts,以及人工干预结合利用LLM强大的评估能力作为“共同评估者”。☆34Sep 1, 2023Updated 2 years ago
- 银行卡号验证库 Python 版☆23Jul 28, 2019Updated 6 years ago
- ChatAdmin是ChatFlow和ChatStudio的后端API服务☆17Oct 18, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆13Mar 13, 2023Updated 3 years ago
- Much simpler client for Stable Diffusion WebUI☆15Feb 10, 2025Updated last year
- Lenovo yoga 710 i7-7500U BigSur OpenCore EFI☆13Mar 19, 2021Updated 5 years ago
- Trojan-Go is a Golang version of Trojan, which is an unidentifiable mechanism that helps you bypass GFW.☆10Mar 22, 2020Updated 6 years ago
- 专为 macOS 打造的 mihomo 菜单栏控制面板,聚合节点、规则、连接、日志与核心管理能力。☆122May 4, 2026Updated last month
- The MCP gateway is a reverse proxy server that forwards requests from clients to the MCP server or uses all MCP servers under the gateway…☆30Jun 1, 2026Updated last week
- 一个会动的简历---欢迎 Fork ➡️☆16Dec 9, 2022Updated 3 years ago
- 基于深度学习的书法字体识别☆54May 2, 2020Updated 6 years ago
- baichuan and baichuan2 finetuning and alpaca finetuning☆33Mar 10, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- near-synonym, 基于大模型LLM的中文反义词/近义词(antonyms/synonyms)工具包. 也可计算词语相似度/句子相似度/文本相似度等。☆31Apr 29, 2025Updated last year
- 高性能文本 Tokenizer 库☆31Feb 2, 2024Updated 2 years ago
- RoBERTa + BiLSTM + CRF for Chinese NER Task☆35Jul 5, 2021Updated 4 years ago
- 🎨 提供 Stable Diffusion AI绘画功能,支持微信小程序,Web后台管理☆19Sep 8, 2024Updated last year
- 使用clash核心,极速批量测试Netflix解锁状态。☆19Apr 19, 2024Updated 2 years ago
- iOS小火箭Shadowrocket ipa 在线安装☆19Feb 16, 2021Updated 5 years ago
- 毕业设计:行人检测系统,pyqt + opencv☆52May 23, 2021Updated 5 years ago