深入探索大型语言模型(LLM)的世界,本项目汇集了跨越五个关键维度的代表性文本数据集——预训练语料库、微调指令数据集、偏好数据集、评估数据集、传统NLP数据集及多模态数据集。我们致力于为研究者和开发者提供最全面的资源,以推动人工智能技术的发展和应用。
☆20Apr 26, 2024Updated 2 years ago
Alternatives and similar repositories for AwesomeLLMsDatasets
Users that are interested in AwesomeLLMsDatasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Build LLM Application with Local Documents☆19Jun 13, 2025Updated 11 months ago
- 校园音乐征集投票系统 A system for electing annual school music☆10May 22, 2026Updated last week
- [ECCV'24] UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening☆10Dec 18, 2025Updated 5 months ago
- Taurix OS kernel. Taurix 系统内核,操作系统原理实(xjb)践(写)☆12Dec 20, 2020Updated 5 years ago
- 量化交易网站,软工三大作业迭代三,团队项目☆11Mar 8, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An Efficent BPE Algorithm Faster then Hugging Face Tokenizer's Implementation☆13Sep 9, 2024Updated last year
- Official implementation of paper "Masked Distillation with Receptive Tokens", ICLR 2023.☆10Mar 13, 2023Updated 3 years ago
- NewsApp包含客户端源码、服务端源码、数据库文件。 基于Miscrosoft人工智能项目ProjectOxford中的Recognition Emotion做的, 主要是基于用户的面部表情来推送不同类别的新闻。 Emotion API可以参考:https://www.p…☆10Mar 2, 2016Updated 10 years ago
- 使用Sentencepiece对中文语料进行分词☆13Nov 30, 2023Updated 2 years ago
- Physcial Informed Extreme Learning Machine(PIELM) method to solve PDEs, such as Possion problem☆18Dec 6, 2024Updated last year
- [ICCV2023] Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events☆10Dec 7, 2024Updated last year
- 基于MFCC特征构建单核GMM的0-9独立词语音识别,MFCC,GMM,sklearn,Isolated word recognition。☆10Nov 18, 2020Updated 5 years ago
- Official code of the MSF model for GZSSAR (ICIG 2023)☆13Jan 3, 2026Updated 4 months ago
- 同济大学计科机器学习大作业☆10Mar 22, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- AU-Expression Knowledge Constrained Representation Learning for Facial Expression Recognition (ICRA 2021)☆11Dec 29, 2023Updated 2 years ago
- ☆14Apr 1, 2023Updated 3 years ago
- Official implementation of the paper "STARS: Self-supervised 3D Action Recognition with Contrastive Tuning".☆17Jan 6, 2025Updated last year
- ☆13Apr 7, 2022Updated 4 years ago
- python爬取股 市数据,并对各个行业股票行情、财务数据进行重构分析☆11Jul 26, 2020Updated 5 years ago
- ☆19Jul 7, 2024Updated last year
- Python package for temporal evolution of initial conditions under the generalized Lugiato-Lefever equation☆19Sep 22, 2022Updated 3 years ago
- The official project website of "Ske2Grid: Skeleton-to-Grid Representation Learning for Action Recognition" (The paper of Ske2Grid is pub…☆19Sep 6, 2023Updated 2 years ago
- Aurora forecasts created from solar wind data (OVATION Prime 2010)☆20Apr 11, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Integrating Large Weather Models with Data Assimilation☆24Jun 2, 2024Updated last year
- 大学整理项目一:一个旅游踩点项目,踩点即对一个个事先有记录的有意思的旅行停驻点进行拜访游玩并留下你的足,这些停驻点我们称之为关注点。在该系统中还可以自己规划行程,事先计划好要前往的关注点 ,路线然后按照系统上的路线规划进行旅游,在旅游中可以写一些文字,发一些图片,整个行程完…☆10Apr 27, 2018Updated 8 years ago
- Implementation of my CS336 assignment1☆44Dec 23, 2025Updated 5 months ago
- Lugiato-Lefever Equation Solver☆27Mar 24, 2024Updated 2 years ago
- Port of Andrej Karpathy's minbpe to Rust☆31May 6, 2024Updated 2 years ago
- ☆16May 14, 2024Updated 2 years ago
- Web one-click mode full process platform, including train data upload, fine-tuning, model merge, model deploy, gpu monitor etc., no need …☆19Nov 28, 2023Updated 2 years ago
- Official repository for Physics Informed Token Transformer (PITT)☆25Feb 24, 2024Updated 2 years ago
- Coupled Generalized Nonlinear Schrodringer Equation solver for birefringent fibers☆20Aug 30, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- BackTime: Backdoor Attacks on Multivariate Time Series Forecasting☆31Apr 14, 2025Updated last year
- This is the official implemntation for SkeleMixCLR☆18Jul 8, 2022Updated 3 years ago
- Official PyTorch implementation of the paper "Hyperbolic Self-paced Learning for Self-supervised Skeleton-based Action Representations" (…☆22Jun 21, 2024Updated last year
- replace the current round robin scheduler in xv6 with a lottery scheduler☆14Oct 19, 2019Updated 6 years ago
- ☆19Oct 19, 2024Updated last year
- A machine learning boosted parallel-in-time differential equation solver framework.☆27May 29, 2023Updated 3 years ago
- ☆22Nov 21, 2021Updated 4 years ago