schinger / FullLLMLinks
Full stack LLM (Pre-training/finetuning, PPO(RLHF), Inference, Quant, etc.)
☆22Updated 4 months ago
Alternatives and similar repositories for FullLLM
Users that are interested in FullLLM are comparing it to the libraries listed below
Sorting:
- 在verl上做reward的定制开发☆56Updated last month
- 基于DPO算法微调语言大模型,简单好上手。☆39Updated 11 months ago
- A curated list of awesome works in Routing LLMs paradigm (👉 Welcome to submit your contributions to this code repository)☆40Updated last month
- ☆44Updated 4 months ago
- Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models☆63Updated 4 months ago
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆33Updated 3 weeks ago
- ☆141Updated last year
- ☆82Updated last year
- Reinforcement Learning in LLM and NLP.☆39Updated last week
- NTK scaled version of ALiBi position encoding in Transformer.☆68Updated last year
- llama,chatglm 等模型的微调☆89Updated 11 months ago
- pytorch分布式训练☆67Updated last year
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆84Updated 3 months ago
- ☆70Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆81Updated 7 months ago
- Train your grpo with zero dataset and low resources, 8bit/4bit/lora/qlora supported, multi-gpu supported ...☆73Updated last month
- ☆19Updated 2 months ago
- 记录NLP、CV、搜索、推荐等AI岗位最新情况。☆29Updated 2 years ago
- ☆16Updated last year
- 怎么训练一个LLM分词器☆150Updated last year
- ☆142Updated 11 months ago
- ☆22Updated last year
- 使用单个24G显卡,从0开始训练LLM☆56Updated last month
- 揣摩研习社关注自然语言和信息检索前沿技术,解读热门科技论文,分享实用科研工具,挖掘人工智能冰山之下的学术和应用价值!☆37Updated 2 years ago
- This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>☆15Updated last year
- A framework for editing the CoTs for better factuality☆50Updated last year
- A collection of survey papers and resources related to Large Language Models (LLMs).☆40Updated last year
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)☆80Updated last year
- ☆44Updated 10 months ago