ArtificialZeng / llama3_explained
the newest version of llama3,source code explained line by line using Chinese
☆22Updated 9 months ago
Alternatives and similar repositories for llama3_explained:
Users that are interested in llama3_explained are comparing it to the libraries listed below
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated 10 months ago
- Imitate OpenAI with Local Models☆85Updated 5 months ago
- 大语言模型训练和服务调研☆35Updated last year
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆55Updated 9 months ago
- ☆25Updated 4 months ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆47Updated last year
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆55Updated last month
- This repository provides an implementation of the paper "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Co…☆55Updated last month
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆59Updated 6 months ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆64Updated last year
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆54Updated last year
- moss chat finetuning☆50Updated 9 months ago
- TianGong-AI-Unstructure☆57Updated 3 weeks ago
- ☆32Updated last month
- ☆34Updated last month
- Here is a demo for PDF parser (Including OCR, object detection tools)☆32Updated 4 months ago
- 多轮共情对话模型PICA☆87Updated last year
- 中文原生检索增强生成测评基准☆108Updated 9 months ago
- LLM+RAG for QA☆21Updated last year
- 通用简单工具项目☆15Updated 4 months ago
- ☆45Updated 8 months ago
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆44Updated last month
- 大型语言模型实战指南:应用实践与场景落地☆55Updated 5 months ago
- 使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆85Updated last year
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆18Updated last year
- ☆36Updated 5 months ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆38Updated 11 months ago
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆42Updated last month
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆69Updated 5 months ago
- SUS-Chat: Instruction tuning done right☆48Updated last year