中文语料库-每日自动更新版 ── 语料文件
☆172Dec 19, 2020Updated 5 years ago
Alternatives and similar repositories for data
Users that are interested in data are comparing it to the libraries listed below
Sorting:
- 中文语料库-每日自动更新版 ── 爬虫代码☆15Aug 8, 2020Updated 5 years ago
- 为了能让AI自己学习玩一些决策类游戏,比 如说《饥荒联机版》,需要让AI识别游戏中的出现的一些图像,所以训练一个新的Yolo模型来让AI识别☆17Jan 7, 2022Updated 4 years ago
- MoRPA Studio based on [RPAStudio](https://github.com/rpa-ai/RPAStudio)☆10Dec 8, 2022Updated 3 years ago
- 数据科学与人工智能中文讲义☆12Feb 23, 2026Updated last week
- ☆12Apr 10, 2023Updated 2 years ago
- 利用目标检测实现的漫画对话框识别,comic,textboxs☆12Apr 14, 2019Updated 6 years ago
- English or Chinses GPT2Dialog model from GPT2-chitchat☆12Feb 23, 2020Updated 6 years ago
- 爬取各种数据的爬虫的样例(百度百科、知乎、微博、简书、搜狗词库),可用于自然语言处理语料收集☆13Jul 7, 2025Updated 7 months ago
- Yuren 13B is an information synthesis large language model that has been continuously trained based on Llama 2 13B, which builds upon the…☆15Sep 25, 2023Updated 2 years ago
- 生化环材劝退文集☆19Mar 19, 2022Updated 3 years ago
- 诗词歌词格言生成配图卡片☆16Feb 7, 2020Updated 6 years ago
- Extract relationships between cyber security entities within unstructured text☆24Sep 28, 2018Updated 7 years ago
- V2EX 的心电图(在线人数随时间的变化)☆17Mar 28, 2016Updated 9 years ago
- 中文 NLP 资源库,语料库,相关的框架,文章收集。☆27May 20, 2022Updated 3 years ago
- 知乎黑名单☆14Aug 14, 2017Updated 8 years ago
- 我们是IT界40岁左右的群体,我们有着共同的烦恼,让我们一起战斗!☆25Jul 13, 2021Updated 4 years ago
- pdf-js-inject,能够将js代码注入到pdf文件中,也可以注入xss-payload到pdf文件中☆31Sep 8, 2024Updated last year
- 把NICONICO(N站)上的视频搬运到BiliBili(B站)哔哩哔哩搬运工具☆30Mar 16, 2020Updated 5 years ago
- 苍穹 - 贴吧签到助手☆10Oct 3, 2014Updated 11 years ago
- 一个多页面的前端电商网站项目☆12Aug 18, 2019Updated 6 years ago
- ☆10Oct 29, 2023Updated 2 years ago
- 中国法律相关语料库☆40Sep 2, 2023Updated 2 years ago
- Trending projects & awesome papers about data-centric llm studies.☆40May 20, 2025Updated 9 months ago
- Midi2PLAY is an application that helps the process of converting MIDI files (.mid) making them compatible with the syntax accepted by the…☆10Dec 30, 2021Updated 4 years ago
- ☆14Jul 23, 2020Updated 5 years ago
- Frenet Corridor Planner: high-efficient and noise-resilient optimal path planner.☆12Sep 2, 2025Updated 5 months ago
- ☆15Oct 24, 2023Updated 2 years ago
- 文本生成(Word2Vec + RNN/LSTM)☆36Jul 1, 2018Updated 7 years ago
- 一种用于序列标注任务的数据标注(分词,NER)的工具☆11Jun 3, 2020Updated 5 years ago
- A quick markdown editor.☆13Oct 11, 2023Updated 2 years ago
- 爬取百度指数数据☆12Dec 8, 2022Updated 3 years ago
- Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…☆40Nov 5, 2023Updated 2 years ago
- OpenPatent 利用大模型辅助生成专利申请文件☆21Mar 30, 2025Updated 11 months ago
- 反弹shell管理工具☆11Feb 10, 2020Updated 6 years ago
- ☆13Jan 6, 2023Updated 3 years ago
- 适用于yakit的规则识别☆14Apr 17, 2025Updated 10 months ago
- ☆10Jun 15, 2024Updated last year
- 使用PaddleNLP搭建seq2seq,实现text2sparql生成,对新浪财经中的部分数据进行解析。☆11Jul 16, 2022Updated 3 years ago
- 120部中文网络小说对话语料库☆16Feb 14, 2017Updated 9 years ago