wei-potato / Train-llm-from-scratch
使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力
☆147Updated 8 months ago
Alternatives and similar repositories for Train-llm-from-scratch:
Users that are interested in Train-llm-from-scratch are comparing it to the libraries listed below
- ☆116Updated 2 weeks ago
- DeepRetrieval - Hacking 🔥Real Search Engines and Text/Data Retrievers with LLM + RL☆196Updated this week
- 用VLLM框架部署千问1.5并进行流式输出☆55Updated 11 months ago
- 从预训练到强化学习的中文llama2☆88Updated last year
- ☆38Updated this week
- Emotion text classification using Llama3-8b with LoRA and FlashAttention. Based on LLaMA-Factory.☆64Updated 7 months ago
- This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …☆45Updated last year
- ✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork☆170Updated this week
- This is a repo for my NanoGPT Pytorch2.0 Implementation when torch2.0 released soon, faster and simpler, a good tutorial learning GPT.☆52Updated last year
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆160Updated 4 months ago
- Support mixed-precsion inference with vllm☆80Updated 2 months ago
- 本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。☆44Updated last year
- MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler a…☆174Updated this week
- The framework to prune LLMs to any size and any config.☆89Updated last year
- Mixed precision inference by Tensorrt-LLM☆77Updated 5 months ago
- Chinese large language model☆117Updated last year
- [ACL2024 Findings] Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM☆56Updated 3 weeks ago
- 教你只用最基本的python语法和numpy一步步实现深度学习框架☆128Updated 7 months ago
- A collection of papers related to knowledge fusion☆54Updated 5 months ago
- This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.☆60Updated 5 months ago
- ☆47Updated 8 months ago
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆161Updated 3 months ago
- Multilingual Corpus of Web Fiction☆191Updated 8 months ago
- adds Sequence Parallelism into LLaMA-Factory☆432Updated this week
- Code and data for crosstalk text generation tasks, exploring whether large models and pre-trained language models can understand humor. …☆144Updated 2 years ago
- Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models☆171Updated 4 months ago
- Codebase for Iterative DPO Using Rule-based Rewards☆227Updated last month
- CCKS‘2021:《SGSum:一个面向体育赛事摘要的人工标注数据集》☆21Updated 3 years ago
- Code of Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Ne…☆24Updated last year
- Source code for ICLR2025 paper "NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation".☆71Updated 3 weeks ago