深入探索大型语言模型(LLM)的世界,本项目汇集了跨越五个关键维度的代表性文本数据集——预训练语料库、微调指令数据集、偏好数据集、评估数据集、传统NLP数据集及多模态数据集。我们致力于为研究者和开发者提供最全面的资源,以推动人工智能技术的发展和应用。
☆20Apr 26, 2024Updated 2 years ago
Alternatives and similar repositories for AwesomeLLMsDatasets
Users that are interested in AwesomeLLMsDatasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated 3 months ago
- ☆11Jun 11, 2024Updated last year
- The code and data for the paper "Lost-in-the-Middle in Long-Text Generation: Synthetic Dataset, Evaluation Framework, and Mitigation"☆13Oct 8, 2025Updated 7 months ago
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments☆14Oct 25, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Build LLM Application with Local Documents☆19Jun 13, 2025Updated 10 months ago
- Open-source code for GEAR☆13Dec 3, 2025Updated 5 months ago
- 校园音乐征集投票系统 A system for electing annual school music☆10Apr 18, 2026Updated 3 weeks ago
- [ECCV'24] UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening☆10Dec 18, 2025Updated 4 months ago
- 从图片中读取EXIF信息,提取拍摄时间和GPS坐标,并使用这些数据获取详细的地址信息。然后将这些信息添加到图片上,并保存带有地理位置标签和时间戳的新图片。☆13May 8, 2025Updated last year
- Taurix OS kernel. Taurix 系统内核,操作系统原理实(xjb)践(写)☆12Dec 20, 2020Updated 5 years ago
- 量化交易网站,软工三大作业迭代三,团队项目☆11Mar 8, 2018Updated 8 years ago
- NewsApp包含客户端源码、服务端源码、数据库文件。 基于Miscrosoft人工智能项目ProjectOxford中的Recognition Emotion做的, 主要是基于用户的面部表情来推送不同类别的新闻。 Emotion API可以参考:https://www.p…☆10Mar 2, 2016Updated 10 years ago
- [ICCV2023] Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events☆10Dec 7, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Large Language Models are zero-shot text classifiers; Smart Expert System: Large Language Models as Text Classifiers☆36May 30, 2024Updated last year
- 基于 BPE 实现的中文分词。优化:预处理,并行计算,多字词,多词表☆14May 14, 2022Updated 3 years ago
- 基于MFCC特征构建单核GMM的0-9独立词语音识别,MFCC,GMM,sklearn,Isolated word recognition。☆10Nov 18, 2020Updated 5 years ago
- 同济大学计科机器学习大作业☆10Mar 22, 2025Updated last year
- ☆14Apr 1, 2023Updated 3 years ago
- Official implementation of the paper "STARS: Self-supervised 3D Action Recognition with Contrastive Tuning".☆17Jan 6, 2025Updated last year
- hexo腾讯云COS一键部署工具hexo-deployer-qcloud-cos使用说明☆19Feb 27, 2022Updated 4 years ago
- Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner☆31Jun 27, 2024Updated last year
- 这是一个可通过网页远程登录管理、可接入讯飞星火、ChatGPT等大语言模型的微信聊天机器人,使用微信网页版协议。☆16Feb 20, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“☆26Jun 24, 2024Updated last year
- Generate Game Character for animation (SSD)☆35Mar 16, 2025Updated last year
- Task-Optimized Adapters for an End-to-End Dialogue System Paper Code☆21Jul 31, 2023Updated 2 years ago
- 2024-2025下半学年人工智能导论(拔尖班)☆16Jun 16, 2025Updated 10 months ago
- A unified CLI tool for querying multiple search engines☆24Aug 24, 2025Updated 8 months ago
- python爬取股市数据,并对各个行业股票行情、财务数据进行重构分析☆11Jul 26, 2020Updated 5 years ago
- Reduce verbose SQL queries to minimal examples☆58Mar 25, 2026Updated last month
- ☆15May 1, 2025Updated last year
- ☆19Jul 7, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 基于微信原生语言+微信云开发+腾讯地图api 构建的智慧旅游导览小程序☆15Oct 25, 2024Updated last year
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆13Jun 1, 2025Updated 11 months ago
- Implementation of my CS336 assignment1☆43Dec 23, 2025Updated 4 months ago
- 这是一个大学生互联网+的大创项目:“一点到家”——云滇家政平台助力乡村振兴,系统前台:微信小程序,后端springboot,数据库mysql。属于一个非常值得推荐的项目,系统源码简单宜读,干净简洁、注释详细,可二次开发。创意满满,贴近生活,缓解就业压力,为农民增收致富,促进…☆14Jun 17, 2023Updated 2 years ago
- 「城语」APP基于A级景区、历史古迹、文物保护单位等基础数据,利用先进的大模型能力实现智能化的Citywalk 路线规划,包括设计一条路线、生成路线攻略、生成景点的推荐理由等三大核心功能;利用大模型减少了人工编辑和推荐的工作量,并可以根据游客的需求进行个性化定制,提升了游客…☆19Feb 20, 2024Updated 2 years ago
- Integrating Large Weather Models with Data Assimilation☆23Jun 2, 2024Updated last year
- 一款很棒的书摘软件 微信小程序 中山大学软件创新大赛十强参赛项目☆16May 3, 2018Updated 8 years ago