llm-factory / LLaMA-Factory-DocLinks
LLaMA Factory Document
☆135Updated 2 weeks ago
Alternatives and similar repositories for LLaMA-Factory-Doc
Users that are interested in LLaMA-Factory-Doc are comparing it to the libraries listed below
Sorting:
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆216Updated 4 months ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆244Updated 7 months ago
- ☆169Updated last year
- Imitate OpenAI with Local Models☆86Updated 10 months ago
- 怎么训练一个LLM分词器☆150Updated last year
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆251Updated 3 weeks ago
- ☆229Updated last year
- ☆142Updated 11 months ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆145Updated 3 months ago
- ☆141Updated last year
- a-m-team's exploration in large language modeling☆161Updated 3 weeks ago
- ☆222Updated last year
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆189Updated 3 weeks ago
- ☆146Updated last year
- ☆109Updated 7 months ago
- Mixture-of-Experts (MoE) Language Model☆189Updated 9 months ago
- ☆152Updated last month
- Alpaca Chinese Dataset -- 中文指令微调数据集☆208Updated 8 months ago
- ☆178Updated 2 months ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆250Updated 6 months ago
- Real-time updated, fine-grained reading list on LLM-synthetic-data.🔥☆262Updated 5 months ago
- 使用单个24G显卡,从0开始训练LLM☆56Updated last month
- An automated pipeline for evaluating LLMs for role-playing.☆187Updated 9 months ago
- 大模型多维度中文对齐评测基准 (ACL 2024)☆394Updated 10 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆373Updated 9 months ago
- The related works and background techniques about Openai o1☆222Updated 5 months ago
- ☆145Updated 5 months ago
- ☆241Updated 2 weeks ago
- 中文基于满血DeepSeek-R1蒸馏数据集☆56Updated 4 months ago
- SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准☆89Updated last year