深入探索大型语言模型(LLM)的世界,本项目汇集了跨越五个关键维度的代表性文本数据集——预训练语料库、微调指令数据集、偏好数据集、评估数据集、传统NLP数据集及多模态数据集。我们致力于为研究者和开发者提供最全面的资源,以推动人工智能技术的发展和应用。
☆20Apr 26, 2024Updated last year
Alternatives and similar repositories for AwesomeLLMsDatasets
Users that are interested in AwesomeLLMsDatasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated 2 months ago
- ☆11Jun 11, 2024Updated last year
- The code and data for the paper "Lost-in-the-Middle in Long-Text Generation: Synthetic Dataset, Evaluation Framework, and Mitigation"☆13Oct 8, 2025Updated 6 months ago
- Trial version for prs platform (python project). Please note that the complete experience requires downloading the Unity resource.☆10Jun 26, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Build LLM Application with Local Documents☆19Jun 13, 2025Updated 10 months ago
- ☆14Aug 21, 2025Updated 7 months ago
- 校园音乐征集投票系统 A system for electing annual school music☆10Apr 10, 2026Updated last week
- [ECCV'24] UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening☆10Dec 18, 2025Updated 4 months ago
- Taurix OS kernel. Taurix 系统内核,操作系统原理实(xjb)践(写)☆12Dec 20, 2020Updated 5 years ago
- An Efficent BPE Algorithm Faster then Hugging Face Tokenizer's Implementation☆13Sep 9, 2024Updated last year
- Physics-Informed deep LSTM architecture to forecast Lorenz and MFE fluid systems☆14May 18, 2020Updated 5 years ago
- 使用Sentencepiece对中文语料进行分词☆13Nov 30, 2023Updated 2 years ago
- Physcial Informed Extreme Learning Machine(PIELM) method to solve PDEs, such as Possion problem☆19Dec 6, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 基于youtube、bilibili等视频平台、webpage网页等,利用零一万物大模型或ollama本地小模型构建大语言模型高质量训练数据集(计划支持可自定义输出的训练数据格式)☆19May 2, 2024Updated last year
- [ICCV2023] Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events☆10Dec 7, 2024Updated last year
- 基于MFCC特征构建单核GMM的0-9独立词语音识别,MFCC,GMM,sklearn,Isolated word recognition。☆10Nov 18, 2020Updated 5 years ago
- 同济大学计科机器学习大作业☆10Mar 22, 2025Updated last year
- ☆32Jan 9, 2026Updated 3 months ago
- ☆14Apr 1, 2023Updated 3 years ago
- hexo腾讯云COS一键部署工具hexo-deployer-qcloud-cos使用说明☆19Feb 27, 2022Updated 4 years ago
- 这是一个可通过网页远程登录管理、可接入讯飞星火、ChatGPT等大语言模型的微信聊天机器人,使用微信网页版协议。☆16Feb 20, 2024Updated 2 years ago
- ☆13Apr 7, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 监控哔哩哔哩直播间数据,实时保存至数据库,并在内置网页上查看精致的可视化统计图表。☆13Jan 4, 2022Updated 4 years ago
- Training neural networks for inverse design of nanophotonic gratings.☆21Dec 15, 2021Updated 4 years ago
- Task-Optimized Adapters for an End-to-End Dialogue System Paper Code☆21Jul 31, 2023Updated 2 years ago
- Implementation of my CS336 assignment1☆43Dec 23, 2025Updated 3 months ago
- 这是一个大学生互联网+的大创项目:“一点到家”——云滇家政平台助力乡村振兴,系统前台:微信小程序,后端springboot,数据库mysql。属于一个非常值得推荐的项目,系统源码简单宜读,干净简洁、注释详细,可二次开发。创意满满,贴近生活,缓解就业压力,为农民增收致富,促进…☆14Jun 17, 2023Updated 2 years ago
- 「城语」APP基于A级景区、历史古迹、文物保护单位等基础数据,利用先进的大模型能力实现智能化的Citywalk 路线规划,包括设计一条路线、生成路线攻略、生成景点的推荐理由等三大核心功能;利用大模型减少了人工编辑和推荐的工作量,并可以根据游客的需求进行个性化定制,提升了游客…☆19Feb 20, 2024Updated 2 years ago
- Integrating Large Weather Models with Data Assimilation☆23Jun 2, 2024Updated last year
- Coupled Generalized Nonlinear Schrodringer Equation solver for birefringent fibers☆21Aug 30, 2022Updated 3 years ago
- 一款很棒的书摘软件 微信小程序 中山大学软件创新大赛十强参赛项目☆16May 3, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 大学整理项目一:一个旅游踩点项目,踩点即对一个个事先有记录的有意思的旅行停驻点进行拜访游玩并留下你的足,这些停驻点我们称之为关注点。在该系统中还可以自己规划行程,事先计划好要前往的关注点 ,路线然后按照系统上的路线规划进行旅游,在旅游中可以写一些文字,发一些图片,整个行程完…☆10Apr 27, 2018Updated 7 years ago
- LLE soliver for python☆16Oct 20, 2024Updated last year
- Lugiato-Lefever Equation Solver☆26Mar 24, 2024Updated 2 years ago
- Port of Andrej Karpathy's minbpe to Rust☆31May 6, 2024Updated last year
- 易用且适用于所有模型的live2d-web库, 手把手教做人(☆30Jan 29, 2026Updated 2 months ago
- BackTime: Backdoor Attacks on Multivariate Time Series Forecasting☆31Apr 14, 2025Updated last year
- 🏔️ PINNACLE: PINN Adaptive ColLocation and Experimental points selection☆28Jul 26, 2024Updated last year