WalkerMitty / Fast-Llama2
Fast instruction tuning with Llama2
☆11Updated 11 months ago
Alternatives and similar repositories for Fast-Llama2:
Users that are interested in Fast-Llama2 are comparing it to the libraries listed below
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆56Updated 11 months ago
- OpenLLMDE: An open source data engineering framework for LLMs☆17Updated last year
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated 11 months ago
- LLM+RAG for QA☆21Updated last year
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆55Updated 10 months ago
- 大语言模型训练和服务调研☆37Updated last year
- 一套代码指令微调大模型☆38Updated last year
- KDD 2024 AQA competition 2nd place solution☆11Updated 8 months ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆47Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆65Updated 2 years ago
- Large-scale exact string matching tool☆15Updated 3 weeks ago
- 中文原生工业测评基准☆13Updated last year
- Code for "An Empirical Study of Retrieval Augmented Generation with Chain-of-Thought"☆12Updated 8 months ago
- make LLM easier to use☆59Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 11 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆36Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆30Updated 10 months ago
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆27Updated 8 months ago
- ☆23Updated 5 months ago
- Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"☆74Updated 7 months ago
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆32Updated 3 months ago
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆54Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆19Updated last year
- ☆45Updated 9 months ago
- This repository provides an implementation of the paper "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Co…☆62Updated 3 weeks ago
- 使用单个24G显卡,从0开始训练LLM☆50Updated 5 months ago
- Music large model based on InternLM2-chat.☆22Updated 3 months ago
- ☆14Updated last year
- ThinkLLM:大语言模型算法与组件实现☆26Updated last week
- AGM阿格姆:AI基因图谱模型,从token-weight权重微粒角度,探索AI模型,GPT\LLM大模型的内在运作机制。☆28Updated last year