THUCNews中文文本分类数据集的处理,该数据集包含84万篇新闻文档,总计14类;在数据集的基础上可以进行文本分类、词向量的训练等任务。
☆23Sep 10, 2020Updated 5 years ago
Alternatives and similar repositories for THUCNewsProject
Users that are interested in THUCNewsProject are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- THUCNews中文文本分类数据集,该数据集包含84万篇新闻文档,总计14类;在该模型的基础上测试多个版本bert分类效果。☆71Feb 2, 2021Updated 5 years ago
- A new release of Chinese sexism dataset and lexicon☆14May 23, 2023Updated 2 years ago
- Official repository for ACM Multimedia'24 paper "MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube a…☆21Aug 11, 2024Updated last year
- Python Scrapy spider that searches Google for a particular keyword and extracts all data from the SERP results. The spider will iterate t…☆18Feb 8, 2023Updated 3 years ago
- Code for our EMNLP 2023 paper - Beneath the Surface: Unveiling Harmful Memes with Multimodal Reasoning Distilled from Large Language Mode…☆15May 5, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Preview markdown files in yazi with mdcat☆10Apr 24, 2025Updated 11 months ago
- 该资源为恶意代码检测相关的论文或文章总结,包括作者撰写的恶意代码与机器学习、深度学习相关博客,希望对您有所帮助~☆15Jul 25, 2020Updated 5 years ago
- 针对Cnews数据集进行分类,使用了torchtext进行文本预处理☆11Sep 16, 2022Updated 3 years ago
- ☆18Jun 21, 2024Updated last year
- flask注册登录简单演示☆18Aug 16, 2018Updated 7 years ago
- ☆37Jun 14, 2019Updated 6 years ago
- 神经网络各种模型PyTorch实现☆43Dec 25, 2022Updated 3 years ago
- Official Code for the WWW'24 Paper: "Towards Explainable Harmful Meme Detection through Multimodal Debate between Large Language Models"☆23Apr 16, 2025Updated 11 months ago
- Final project for "Navigating Data Structures" in the Introduction to Self Driving Cars Nanodegree☆11Jun 27, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Open-Sora: Democratizing Efficient Video Production for All☆19Nov 7, 2024Updated last year
- 能够采集微博博主,博文,评论,分析博主信息,博文话题等,构建社交网络,同时对数据和网络进行分析的工具.☆24May 24, 2019Updated 6 years ago
- [ACL2025] STATE ToxiCN: A Benchmark for Span-level Target-Aware Toxicity Extraction in Chinese Hate Speech Detection☆46Oct 25, 2025Updated 5 months ago
- 本项目采用多模态特征融合和引入外部知识的方式来检测短视频谣言,创新性地引入了对比学习的方式实现了谣言的区分☆26Oct 17, 2023Updated 2 years ago
- ☆17May 10, 2023Updated 2 years ago
- ☆25May 7, 2022Updated 3 years ago
- Xfce4 HotCorner Panel Plugin☆15Aug 7, 2023Updated 2 years ago
- Recognizes 85.7% of the all the facial expressions in the Toronto Face Dataset☆10May 2, 2016Updated 9 years ago
- 基于BERT模型的深度学习中文文本分类实现,包含大约20000条新闻的训练和测试集,包装有简单HTTP接口可供调用。☆24Jun 25, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Demo for MSCP which will be appeared on TGRS☆13May 7, 2019Updated 6 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- How to use XGBoost for multi-step time series forecasting☆44Nov 2, 2022Updated 3 years ago
- LLM benchmarks☆13Feb 22, 2024Updated 2 years ago
- ☆12Nov 5, 2024Updated last year
- Knowledge Graph based Question Answering benchmark.☆10Feb 1, 2020Updated 6 years ago
- 中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级☆10May 6, 2024Updated last year
- ☆13Mar 5, 2025Updated last year
- Code and data for automatic paraphrase dataset augmentation.☆11Mar 8, 2021Updated 5 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- This is the official implementation to the EMNLP 2024 paper: Modeling Layout Reading Order as Ordering Relations for Visually-rich Docume…☆31Jan 19, 2026Updated 2 months ago
- 百度网盘AI大赛——图像处理挑战赛:文档图像摩尔纹消除第2名方案☆43Nov 28, 2023Updated 2 years ago
- Align, a general text alignment function☆15Dec 7, 2023Updated 2 years ago
- ☆11Nov 5, 2024Updated last year
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- ☆55Mar 24, 2022Updated 4 years ago
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year