THUCNews中文文本分类数据集的处理,该数据集包含84万篇新闻文档,总计14类;在数据集的基础上可以进行文本分类、词向量的训练等任务。
☆25Sep 10, 2020Updated 5 years ago
Alternatives and similar repositories for THUCNewsProject
Users that are interested in THUCNewsProject are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Demonstrating the efficiency of pmdarima’s auto_arima() function compared to implementing a traditional ARIMA model.☆12Feb 16, 2021Updated 5 years ago
- 基于ElasticSearch的海量文本检索系统☆20Jun 3, 2018Updated 7 years ago
- ☆11Oct 15, 2023Updated 2 years ago
- Tongji Data Structure Project - Social Network Links Prediction; 同济大学数据结构课程设计 - 社交网络预测☆11Sep 12, 2023Updated 2 years ago
- Official repository for ACM Multimedia'24 paper "MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube a…☆21Aug 11, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Python Scrapy spider that searches Google for a particular keyword and extracts all data from the SERP results. The spider will iterate t…☆18Feb 8, 2023Updated 3 years ago
- This is about the English test - cloze test which use the Google BERT model to predict the probable word.☆11May 18, 2021Updated 5 years ago
- Demonstration of how to use the Tor Browser and WebDriver in Python.☆14Aug 5, 2023Updated 2 years ago
- Code for our EMNLP 2023 paper - Beneath the Surface: Unveiling Harmful Memes with Multimodal Reasoning Distilled from Large Language Mode…☆15May 5, 2024Updated 2 years ago
- 基于pytorch的关键点回归的两种方法:直接回归和heatmap热力图法☆11Nov 13, 2021Updated 4 years ago
- 同济大学数据结构与算法设计课程设计大作业:二叉树、社会关系网络☆18Nov 14, 2022Updated 3 years ago
- 该资源为恶意代码检测相关的论文或文章总结,包括作者撰写的恶意代码与机器学习、深度学习相关博客,希望对您有所帮助~☆15Jul 25, 2020Updated 5 years ago
- Preview markdown files in yazi with mdcat☆12Apr 24, 2025Updated last year
- Hackmageddon☆20Jan 22, 2021Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 针对Cnews数据集进行分类,使用了torchtext进行文本预处理☆11Sep 16, 2022Updated 3 years ago
- ☆18Jun 21, 2024Updated last year
- ResNLS: An Improved Model for Stock Price Forecasting.☆21Apr 13, 2026Updated last month
- ☆36Jun 14, 2019Updated 6 years ago
- a SJTU beamer template☆23Jun 4, 2012Updated 13 years ago
- 神经网络各种模型PyTorch实现☆43Dec 25, 2022Updated 3 years ago
- Official Code for the WWW'24 Paper: "Towards Explainable Harmful Meme Detection through Multimodal Debate between Large Language Models"☆26Apr 16, 2025Updated last year
- The official implement of CTRNet++.☆15Dec 30, 2024Updated last year
- Final project for "Navigating Data Structures" in the Introduction to Self Driving Cars Nanodegree☆10Jun 27, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 能够采集微博博主,博文,评论,分析博主信息,博文话题等,构建社交网络,同时对数据和网络进行分析的工具.☆24May 24, 2019Updated 7 years ago
- 身份证识别程序,包含切图程序,通过cv2切除矩形图片多余部分☆17Jan 26, 2019Updated 7 years ago
- C++ Nanodegree Program: https://www.udacity.com/course/c-plus-plus-nanodegree--nd213☆12Oct 14, 2021Updated 4 years ago
- ☆11Aug 27, 2020Updated 5 years ago
- [ACL2025] STATE ToxiCN: A Benchmark for Span-level Target-Aware Toxicity Extraction in Chinese Hate Speech Detection☆48Oct 25, 2025Updated 7 months ago
- 本项目采用多模态特征融合和引入外部知识的方式来检测短视频谣言,创新性地引入了对比学习的方式实现了谣言的区分☆28Oct 17, 2023Updated 2 years ago
- ☆17May 10, 2023Updated 3 years ago
- Xfce4 HotCorner Panel Plugin☆16Aug 7, 2023Updated 2 years ago
- Recognizes 85.7% of the all the facial expressions in the Toronto Face Dataset☆10May 2, 2016Updated 10 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 基于BERT模型的深度学习中文文本分类实现,包含大约20000条新闻的训练和测试集,包装有简单HTTP接口可供调用。☆24Jun 25, 2020Updated 5 years ago
- 文本聚类☆38Aug 4, 2021Updated 4 years ago
- Code for Cross-Modality and Within-Modality Regularization for Audio-Visual DeepFake Detection☆41Apr 6, 2024Updated 2 years ago
- Halcon using and programming all in one.☆21May 25, 2025Updated last year
- Code listing for the paper 'Heterogeneity-aware Twitter Bot Detection with Relational Graph Transformers'. AAAI 2022.☆40Mar 1, 2022Updated 4 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- ☆37Jul 8, 2019Updated 6 years ago