语料数据和词库收集:中文、英文停用词,情感分析,分类词典,敏感词库(违禁词,审查词)。stop words, sentiment analysis, thesaurus, censorship/sensitive word
☆35Feb 9, 2026Updated 4 months ago
Alternatives and similar repositories for data-corpus
Users that are interested in data-corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated last year
- Template Engine Benchmark Test☆16May 12, 2014Updated 12 years ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- CHisIEC An Information Extraction Corpus for Ancient Chinese History☆23Nov 25, 2025Updated 7 months ago
- This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".☆66Nov 5, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Turn Dify API into OpenAI API schema☆17Aug 16, 2024Updated last year
- 自然语言处理之中文文本分类(以垃圾短信识别为例)☆24Jun 4, 2020Updated 6 years ago
- Chinese character variant converter. 中文异体字转换器。☆23Oct 17, 2025Updated 8 months ago
- Decoding h264 rtsp stream with libavformat☆15Nov 19, 2015Updated 10 years ago
- Yet Another Chinese Spelling Check Dataset (YACSC)☆22Oct 25, 2023Updated 2 years ago
- Otstar's Space的源码,个人主页。☆11May 17, 2026Updated last month
- hikvision android sdk,海康威视安卓二次开发sdk☆16Mar 31, 2020Updated 6 years ago
- Simple H264 decoding plugin for Flutter☆17Aug 18, 2021Updated 4 years ago
- 本文提出了一个基于“文心一言”的中国LLMs的安全评估基准,其中包括8种典型的安全场景和6种指令攻击类型。此外,本文还提出了安全评估的框架和过程,利用手动编写和收集开源数据的测试Prompts,以及人工干预结合利用LLM强大的评估能力作为“共同评估者”。☆34Sep 1, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ChatAdmin是ChatFlow和ChatStudio的后端API服务☆17Oct 18, 2024Updated last year
- Screen capture tool to capture or grab an specific window on Desktop in Java☆11Jan 23, 2020Updated 6 years ago
- 企业发卡源码,发卡系统,发卡源码,发卡程序,PHP发卡源码,骑士企业版发卡☆17Apr 20, 2026Updated 2 months ago
- 推文工具,b站UP主(烂桃侠)地址:https://space.bilibili.com/445842828☆17Apr 12, 2024Updated 2 years ago
- 传奇,最怀念的游戏!☆15Mar 30, 2017Updated 9 years ago
- Trojan-Go is a Golang version of Trojan, which is an unidentifiable mechanism that helps you bypass GFW.☆10Mar 22, 2020Updated 6 years ago
- 专为 macOS 打造的 mihomo 菜单栏控制面板,聚合节点、规则、连接、日志与核心管理能力。☆123May 4, 2026Updated 2 months ago
- A well-designed cross-platform ChatGPT UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT 应用。☆14Updated this week
- The MCP gateway is a reverse proxy server that forwards requests from clients to the MCP server or uses all MCP servers under the gateway…☆31Jun 1, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 一个会动的简历---欢迎 Fork ➡️☆16Dec 9, 2022Updated 3 years ago
- chatglm_rlhf_finetuning☆30Oct 10, 2023Updated 2 years ago
- 微信小程序,支持读书笔记、时间规划、在线社区、休闲娱乐等。云开发作为后端支持。☆12Jun 27, 2019Updated 7 years ago
- near-synonym, 基于大模型LLM的中文反义词/近义词(antonyms/synonyms)工具包. 也可计算词语相似度/句子相似度/文本相似度等。☆31Apr 29, 2025Updated last year
- 基于swift语言,测试着玩的☆22Sep 23, 2019Updated 6 years ago
- 高性能文本 Tokenizer 库☆31Feb 2, 2024Updated 2 years ago
- 百度百科 500 万数据集☆50Dec 1, 2023Updated 2 years ago
- 🎨 提供 Stable Diffusion AI绘画功能,支持微信小程序,Web后台管理☆19Sep 8, 2024Updated last year
- 使用clash核心,极速批量测试Netflix解锁状态。☆20Apr 19, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆17Jan 15, 2022Updated 4 years ago
- For video and audio streaming☆27Aug 31, 2020Updated 5 years ago
- simple☆22Nov 1, 2017Updated 8 years ago
- kdd2017 travel time competition rank 28/3574☆30Jun 2, 2017Updated 9 years ago
- 基于bert的中文实体链接☆30Nov 24, 2021Updated 4 years ago
- 实现linux端的视频提取并编码成H264格式发送到Windows端,再通过FFMPEG解码成YUV420P码流,最后Opencv转码成RGB32实时显示☆25Mar 17, 2017Updated 9 years ago
- 在新标签页随机展示不同的精美图片,并提供日历、天气、倒数日、待办事项、专注模式、白噪音、快速链接等实用功能☆15May 18, 2026Updated last month