THUCNews中文文本分类数据集的处理,该数据集包含84万篇新闻文档,总计14类;在数据集的基础上可以进行文本分类、词向量的训练等任务。
☆23Sep 10, 2020Updated 5 years ago
Alternatives and similar repositories for THUCNewsProject
Users that are interested in THUCNewsProject are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- THUCNews中文文本分类数据集,该数据集包含84万篇新闻文档,总计14类;在该模型的基础上测试多个版本bert分类效果。☆70Feb 2, 2021Updated 5 years ago
- A new release of Chinese sexism dataset and lexicon☆15May 23, 2023Updated 2 years ago
- 多模型中文cnews新闻文本分类☆59Mar 25, 2020Updated 6 years ago
- Demonstrating the efficiency of pmdarima’s auto_arima() function compared to implementing a traditional ARIMA model.☆12Feb 16, 2021Updated 5 years ago
- Official repository for ACM Multimedia'24 paper "MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube a…☆21Aug 11, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Python Scrapy spider that searches Google for a particular keyword and extracts all data from the SERP results. The spider will iterate t…☆18Feb 8, 2023Updated 3 years ago
- This is about the English test - cloze test which use the Google BERT model to predict the probable word.☆11May 18, 2021Updated 4 years ago
- Code for our EMNLP 2023 paper - Beneath the Surface: Unveiling Harmful Memes with Multimodal Reasoning Distilled from Large Language Mode…☆15May 5, 2024Updated last year
- 基于pytorch的关键点回归的两种方法:直接回归和heatmap热力图法☆11Nov 13, 2021Updated 4 years ago
- 该资源为恶意代码检测相关的论文或文章总结,包括作者撰写的恶意代码与机器学习、深度学习相关博客,希望对您有所帮助~☆15Jul 25, 2020Updated 5 years ago
- 使用LSTM、ANN网络进行时间序列的多步预测。一般情况下机器学习算法在进行时间序列预测时采取一步预测的方法。该段 代码将其拓展到多步预测的情形。主要改进在于数据的构建。LSTM and ANN are used to predict the time series. In …☆16Sep 9, 2020Updated 5 years ago
- Hackmageddon☆20Jan 22, 2021Updated 5 years ago
- 针对Cnews数据集进行分类,使用了torchtext进行文本预处理☆11Sep 16, 2022Updated 3 years ago
- ☆18Jun 21, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official Code for the WWW'24 Paper: "Towards Explainable Harmful Meme Detection through Multimodal Debate between Large Language Models"☆25Apr 16, 2025Updated 11 months ago
- 基于深度学习的恶意代码检测☆19Jun 12, 2020Updated 5 years ago
- 能够采集微博博主,博文,评论,分析博主信息,博文话题等,构建社交网络,同时对数据和网络进行分析的工具.☆24May 24, 2019Updated 6 years ago
- [ACL2025] STATE ToxiCN: A Benchmark for Span-level Target-Aware Toxicity Extraction in Chinese Hate Speech Detection☆46Oct 25, 2025Updated 5 months ago
- Popular method ARIMA for outlier detection purposes☆28Jun 25, 2024Updated last year
- 本项目采用多模态特征融合和引入外部知识的方式来检测短视频谣言,创新性地引入了对比学习的方式实现了谣言的区分☆27Oct 17, 2023Updated 2 years ago
- ☆25May 7, 2022Updated 3 years ago
- This repository is the team ETS-Lab's solution towards KDD Cup 2022.☆34Jun 11, 2024Updated last year
- Code for Cross-Modality and Within-Modality Regularization for Audio-Visual DeepFake Detection☆40Apr 6, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code listing for the paper 'Heterogeneity-aware Twitter Bot Detection with Relational Graph Transformers'. AAAI 2022.☆40Mar 1, 2022Updated 4 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- 使用OpenCV部署YOLOV3检测二维码,包含C++和Python两种版本的程序,仅仅只依赖opencv库就能运行☆26Dec 12, 2021Updated 4 years ago
- Official repository for EMNLP'24 paper "ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Pertu…☆46Oct 3, 2024Updated last year
- ☆37Jul 8, 2019Updated 6 years ago
- ☆12Nov 5, 2024Updated last year
- 中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级☆10May 6, 2024Updated last year
- Knowledge Graph based Question Answering benchmark.☆10Feb 1, 2020Updated 6 years ago
- ☆13Mar 5, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code and data for automatic paraphrase dataset augmentation.☆11Mar 8, 2021Updated 5 years ago
- 一款开源的基金估值框架☆25Mar 30, 2026Updated 2 weeks ago
- ☆11Nov 5, 2024Updated last year
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- ☆55Mar 24, 2022Updated 4 years ago
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- This repository houses the datasets/resources used in paper "ChatGPT Informed Graph Neural Network for Stock Movement Prediction". Dive i…☆50Sep 18, 2023Updated 2 years ago