the-seeds / LLaMA-Factory-DocLinks
LLaMA Factory Document
☆152Updated 2 weeks ago
Alternatives and similar repositories for LLaMA-Factory-Doc
Users that are interested in LLaMA-Factory-Doc are comparing it to the libraries listed below
Sorting:
- OpenSeek aims to unite the global open source community to drive collaborative innovation in algorithms, data and systems to develop next…☆235Updated last month
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆251Updated last year
- An automated pipeline for evaluating LLMs for role-playing.☆201Updated last year
- a toolkit on knowledge distillation for large language models☆181Updated 2 weeks ago
- ☆115Updated 11 months ago
- WritingBench: A Comprehensive Benchmark for Generative Writing☆125Updated last month
- ☆233Updated last year
- Mixture-of-Experts (MoE) Language Model☆189Updated last year
- ☆147Updated last year
- ☆169Updated 6 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆263Updated 3 months ago
- Imitate OpenAI with Local Models☆88Updated last year
- AN O1 REPLICATION FOR CODING☆336Updated 10 months ago
- ☆299Updated 5 months ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆160Updated 7 months ago
- ☆179Updated last year
- a-m-team's exploration in large language modeling☆190Updated 5 months ago
- Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"☆84Updated last year
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆379Updated this week
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆218Updated 3 months ago
- ☆49Updated last year
- ☆234Updated last year
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆293Updated 4 months ago
- SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准☆93Updated last year
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆103Updated 4 months ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆257Updated 10 months ago
- ☆40Updated last year
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆45Updated last year
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆261Updated 8 months ago
- 怎么训练一个LLM分词器☆153Updated 2 years ago