一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测,低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。
☆225Dec 8, 2023Updated 2 years ago
Alternatives and similar repositories for llms_tool
Users that are interested in llms_tool are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 百度UIE抽取模型torch版训练预测框架☆12Nov 20, 2024Updated last year
- 一个用于训练句子embedding的工具,支持Cosent以及Simcse、infonce☆23Jun 17, 2025Updated 11 months ago
- 意图槽位模型训练框架(单意图、独立槽位),使用的是JointBERT的结构进行编写,槽位提取时可以选择双指针(Binary Pointer)实体抽取和Global Pointer两种方法。☆21May 11, 2024Updated 2 years ago
- 一个使用tensorflow2.3开发的基于Bert的意图识别的自然语言理解任务Demo☆27May 11, 2024Updated 2 years ago
- 使用torch整合两种经典的指针NER抽取范式,分别是SpanBert和苏神的GlobalPointer,简单加了些tricks,配置后一键运行☆134May 29, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 常用的一些用于图像分类的模型demo。☆11Nov 22, 2022Updated 3 years ago
- 爬取百度贴吧、TapTap、appstore、微博官方博主上的游戏评论(基于redis_scrapy),过滤器采用了bloomfilter。☆54Nov 15, 2018Updated 7 years ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,642Oct 24, 2024Updated last year
- Tensorflow2.3的文本分类项目,支持各种分类模型,支持相关tricks。☆177Nov 28, 2024Updated last year
- 基于Tensorflow2.3开发的NER模型,都是CRF范式,包含Bilstm(IDCNN)-CRF、Bert-Bilstm(IDCNN)-CRF、Bert-CRF,可微调预训练模型,可对抗学习,用于命名实体识别,配置后可直接运行。☆432Dec 10, 2024Updated last year
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated 2 years ago
- [ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"☆25Mar 28, 2024Updated 2 years ago
- 一些 LLM 方面的从零复现笔记☆250Apr 29, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- share data, prompt data , pretraining data☆36Nov 30, 2023Updated 2 years ago
- ☆16May 31, 2024Updated last year
- Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型☆414Oct 21, 2023Updated 2 years ago
- 基于torch的,支持logits、crf、span、global_pointer四种方法的命名实体识别任务框架☆25Feb 29, 2024Updated 2 years ago
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆381Jul 21, 2024Updated last year
- ChatGLM-6B 指令学习|指令数据|Instruct☆652Apr 10, 2023Updated 3 years ago
- Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.☆1,015Apr 27, 2024Updated 2 years ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆34Aug 9, 2023Updated 2 years ago
- BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)☆8,282Oct 16, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The open source implementation of DeepSeek-R1. 开源复现 DeepSeek-R1☆277Mar 10, 2025Updated last year
- smart chinese LLm☆19Jan 31, 2024Updated 2 years ago
- 用于大模 型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.☆255Aug 1, 2023Updated 2 years ago
- Web one-click mode full process platform, including train data upload, fine-tuning, model merge, model deploy, gpu monitor etc., no need …☆19Nov 28, 2023Updated 2 years ago
- Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集☆3,052Apr 14, 2024Updated 2 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Just a template for quickly creating a python library.☆10Updated this week
- Deepspeed、LLM、Medical_Dialogue、医疗大模型、预训练、微调☆308May 3, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Open Multilingual Chatbot for Everyone☆1,273Jun 8, 2025Updated 11 months ago
- 使用peft库,对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调,并做lora model和base model的merge及4bit的量化(quantize)。☆359Aug 22, 2023Updated 2 years ago
- Agent HITL (Human-in-the-Loop) - 一个支持人机协作的AI代理系统,基于LangChain、LangGraph和FastAPI构建☆24Jun 2, 2025Updated 11 months ago
- ChatGLM-6B添加了RLHF的实现,以及部分核心代码的逐行讲解 ,实例部分是做了个新闻短标题的生成,以及指定context推荐的RLHF的实现☆88Aug 16, 2023Updated 2 years ago
- ☆239May 10, 2024Updated 2 years ago
- 🩹Editing large language models within 10 seconds⚡☆1,363Aug 13, 2023Updated 2 years ago
- ⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SF…☆2,422Sep 29, 2023Updated 2 years ago