深入探索大型语言模型(LLM)的世界,本项目汇集了跨越五个关键维度的代表性文本数据集——预训练语料库、微调指令数据集、偏好数据集、评估数据集、传统NLP数据集及多模态数据集。我们致力于为研究者和开发者提供最全面的资源,以推动人工智能技术的发展和应用。
☆20Apr 26, 2024Updated 2 years ago
Alternatives and similar repositories for AwesomeLLMsDatasets
Users that are interested in AwesomeLLMsDatasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The latest progress of Personalized Large Language Models (LLMs).☆46May 18, 2026Updated last week
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago
- Data for evaluating GPT-4V☆11Oct 26, 2023Updated 2 years ago
- ☆11Jun 11, 2024Updated last year
- The code and data for the paper "Lost-in-the-Middle in Long-Text Generation: Synthetic Dataset, Evaluation Framework, and Mitigation"☆13Oct 8, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Build LLM Application with Local Documents☆19Jun 13, 2025Updated 11 months ago
- ☆10Nov 17, 2022Updated 3 years ago
- ☆17Feb 6, 2025Updated last year
- ☆14Aug 21, 2025Updated 9 months ago
- chatgpt库的调用, 支持流式和非流式,可以实现类似官方chatgpt的显示效果☆12Jan 30, 2024Updated 2 years ago
- 校园音乐征集投票系统 A system for electing annual school music☆10May 22, 2026Updated last week
- [ECCV'24] UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening☆10Dec 18, 2025Updated 5 months ago
- Taurix OS kernel. Taurix 系统内核,操作系统原理实(xjb)践(写)☆12Dec 20, 2020Updated 5 years ago
- 量化交易网站,软工三大作业迭代三,团队项目☆11Mar 8, 2018Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An Efficent BPE Algorithm Faster then Hugging Face Tokenizer's Implementation☆13Sep 9, 2024Updated last year
- NewsApp包含客户端源码、服务端源码、数据库文件。 基于Miscrosoft人工智能项目ProjectOxford中的Recognition Emotion做的, 主要是基于用户的面部表情来推送不同类别的新闻。 Emotion API可以参考:https://www.p…☆10Mar 2, 2016Updated 10 years ago
- ☆19Apr 19, 2024Updated 2 years ago
- Physics-Informed deep LSTM architecture to forecast Lorenz and MFE fluid systems☆14May 18, 2020Updated 6 years ago
- 使用Sentencepiece对中文语料进行分词☆13Nov 30, 2023Updated 2 years ago
- Physcial Informed Extreme Learning Machine(PIELM) method to solve PDEs, such as Possion problem☆18Dec 6, 2024Updated last year
- [ICCV2023] Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events☆10Dec 7, 2024Updated last year
- Library of some notes☆10May 29, 2023Updated 3 years ago
- 同济大学计科机器学习大作业☆10Mar 22, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [NeurIPS'25] Backdoor Cleaning without External Guidance in MLLM Fine-tuning☆20Oct 13, 2025Updated 7 months ago
- Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner☆31Jun 27, 2024Updated last year
- 这是一个可通过网页远程登录管理、可接入讯飞星火、ChatGPT等大语言模型的微信聊天机器人,使用微信网页版协议。☆16Feb 20, 2024Updated 2 years ago
- Official PyTorch Implementation of Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos☆11Apr 26, 2026Updated last month
- 监控哔哩哔哩直播间数据,实时保存至数据库,并在内置网页上查看精致的可视化统计图表。☆13Jan 4, 2022Updated 4 years ago
- ☆13Apr 7, 2022Updated 4 years ago
- 2024-2025下半学年人工智能导论(拔尖班)☆16Jun 16, 2025Updated 11 months ago
- python爬取股市数 据,并对各个行业股票行情、财务数据进行重构分析☆11Jul 26, 2020Updated 5 years ago
- ☆15May 1, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆13Jun 1, 2025Updated 11 months ago
- Explore Apple Health data using LLMs☆22Jan 26, 2025Updated last year
- Um site dinâmico desenvolvido em Next.js, utilizando tecnologias como Tailwind CSS e diversas bibliotecas de UI do Radix UI e Shadcn UI. …☆20Sep 24, 2024Updated last year
- 这是一个大学生互联网+的大创项目:“一点到家”——云滇家政平台助力乡村振兴,系统前台:微信小程序,后端springboot,数据库mysql。属于一个非常值得推荐的项目,系统源码简单宜读,干净简洁、注释详细,可二次开发。创意满满,贴近生活,缓解就业压力,为农民增收致富,促进…☆14Jun 17, 2023Updated 2 years ago
- The official project website of "Ske2Grid: Skeleton-to-Grid Representation Learning for Action Recognition" (The paper of Ske2Grid is pub…☆19Sep 6, 2023Updated 2 years ago
- 「城语 」APP基于A级景区、历史古迹、文物保护单位等基础数据,利用先进的大模型能力实现智能化的Citywalk 路线规划,包括设计一条路线、生成路线攻略、生成景点的推荐理由等三大核心功能;利用大模型减少了人工编辑和推荐的工作量,并可以根据游客的需求进行个性化定制,提升了游客…☆19Feb 20, 2024Updated 2 years ago
- 一款很棒的书摘软件 微信小程序 中山大学软件创新大赛十强参赛项目☆16May 3, 2018Updated 8 years ago