llm-factory / LLaMA-Factory-Doc
LLaMA Factory Document
☆121Updated last week
Alternatives and similar repositories for LLaMA-Factory-Doc:
Users that are interested in LLaMA-Factory-Doc are comparing it to the libraries listed below
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆240Updated 5 months ago
- ☆143Updated 9 months ago
- ☆107Updated 5 months ago
- Mixture-of-Experts (MoE) Language Model☆186Updated 7 months ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆249Updated 4 months ago
- 🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability☆147Updated 2 weeks ago
- 中文原生检索增强生成测评基准☆115Updated last year
- ☆97Updated last year
- 怎么训练一个LLM分词器☆144Updated last year
- ☆157Updated 3 weeks ago
- Imitate OpenAI with Local Models☆88Updated 7 months ago
- ☆218Updated last year
- SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准☆84Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆78Updated 5 months ago
- Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"☆74Updated 7 months ago
- ☆146Updated last month
- ☆140Updated last year
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆187Updated 2 months ago
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆316Updated 9 months ago
- ☆226Updated 11 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆282Updated last week
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆361Updated 7 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆174Updated last week
- 大语言模型指令调优工具(支持 FlashAttention)☆172Updated last year
- 一些 LLM 方面的从零复现笔记☆185Updated last week
- WritingBench: A Comprehensive Benchmark for Generative Writing☆69Updated last week
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆124Updated 10 months ago
- ☆130Updated 3 months ago
- ☆108Updated 9 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆135Updated 4 months ago