一套代码指令微调大模型
☆39Aug 1, 2023Updated 2 years ago
Alternatives and similar repositories for LLMs_train
Users that are interested in LLMs_train are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Solution for the Foursquare - Location Matching competition☆14Jul 8, 2022Updated 3 years ago
- 儿童故事常识推理与寓意理解评测(Commonsense Reasoning and Moral Understanding Evaluation in Children's Stories,CRMU)☆18Oct 22, 2024Updated last year
- 中文文本的向量表示方法(Sentence-BERT, CoSENT)的PyTorch简单实现,可以用于文本相似度计算。☆10Mar 27, 2022Updated 4 years ago
- A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck☆10Sep 9, 2022Updated 3 years ago
- Exploration of semantic chunking and chunk classification☆19Sep 16, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- 用于AIOPS24挑战赛的Demo☆64Jun 21, 2024Updated last year
- Sparse Multilabel Categorical Crossentropy☆11Sep 10, 2023Updated 2 years ago
- CCL2024 Chinese Essay Rhetoric Recognition and Understanding☆17Oct 1, 2024Updated last year
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- 31st place silver medal solution to USPPPM Kaggle competition☆20Jun 23, 2022Updated 3 years ago
- ai4code competition source code☆19Aug 12, 2022Updated 3 years ago
- ☆11Jan 19, 2025Updated last year
- 使用指令微调对大模型进行微调。☆11Jun 28, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 最大开源中文问答数据集 ,助力中文LLM.The largest open-source Chinese Q&A dataset, supporting Chinese LLM☆10Jul 31, 2023Updated 2 years ago
- Train a tiny LLaMA model from scratch to repeat your words using Reinforcement Learning from Human Feedback (RLHF)☆18May 23, 2024Updated last year
- 2023全球智能汽车AI挑战赛——赛道一:AI大模型检索问答, 75+ baseline☆61Dec 7, 2023Updated 2 years ago
- PyTorch分类网络:Python训练_测试_模型转换 && Windows_LibTorch_C++部署☆19Sep 16, 2021Updated 4 years ago
- 使用numpy从零开始实现llama3 的推理流程,并对其进行封装,对比GPU,CPU上的表现以及Lora微调。llama3 implemented from scratch using numpy and lora fine-tune.。☆12Jul 16, 2024Updated last year
- ☆14Apr 19, 2022Updated 3 years ago
- ☆24Feb 15, 2022Updated 4 years ago
- GAIIC赛道一:影像学 NLP — 医学影像诊断报告生成 [A100换你大棚甜瓜 Rank-12 方案]☆68Jun 9, 2023Updated 2 years ago
- ☆17Jun 30, 2021Updated 4 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆12Jun 19, 2025Updated 9 months ago
- 基于用户画像的商品推荐挑战赛Rank5☆25Sep 22, 2021Updated 4 years ago
- Code and data for the paper "Dual Dynamic Memory Network for End-to-End Multi-turn Task-oriented Dialog Systems".☆14Aug 16, 2022Updated 3 years ago
- Recommended system algorithm implementation☆10Feb 18, 2020Updated 6 years ago
- Target-dependent Sentiment Classification with BERT☆14Aug 24, 2023Updated 2 years ago
- CCF-BDCI 小样本数据分类任务☆17Jan 13, 2023Updated 3 years ago
- 灵枢量化 | NexusQuant - AI 驱动的多策略、多时间框架加密货币交易监控系统☆32Jan 13, 2026Updated 2 months ago
- DCIC2023 Fraud Risk Identification Competition Solution.☆26Mar 30, 2023Updated 3 years ago
- Dataset for 'Learning End-to-End Goal-Oriented Dialog with Multiple Answers' EMNLP 2018☆18Nov 16, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- ☆23Apr 21, 2023Updated 2 years ago
- deepspeed+trainer简单高效实现多卡微调大模型☆133May 27, 2023Updated 2 years ago
- for DTCA model☆10Oct 17, 2023Updated 2 years ago
- Source code for "Domain-Aware Dialogue State Tracker for Multi-Domain Dialogue Systems"☆10Oct 5, 2020Updated 5 years ago
- "Man's relationship with technology is complex. We always invent technology, but then technology comes back and reinvents us." ― Atul Jal…☆25Dec 18, 2019Updated 6 years ago
- helper code for kaggle handm 2022 recommendation competition☆13May 16, 2022Updated 3 years ago