wei-potato / Train-llm-from-scratch
使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力
☆141Updated 7 months ago
Alternatives and similar repositories for Train-llm-from-scratch:
Users that are interested in Train-llm-from-scratch are comparing it to the libraries listed below
- This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …☆45Updated last year
- Support mixed-precsion inference with vllm☆80Updated last month
- This is a repo for my NanoGPT Pytorch2.0 Implementation when torch2.0 released soon, faster and simpler, a good tutorial learning GPT.☆52Updated last year
- Mixed precision inference by Tensorrt-LLM☆76Updated 3 months ago
- The framework to prune LLMs to any size and any config.☆87Updated 11 months ago
- Emotion text classification using Llama3-8b with LoRA and FlashAttention. Based on LLaMA-Factory.☆59Updated 6 months ago
- MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler a…☆172Updated 3 months ago
- 本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。☆44Updated 11 months ago
- 从预训练到强化学习的中文llama2☆86Updated last year
- Chinese large language model☆117Updated last year
- This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.☆58Updated 4 months ago
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆158Updated 3 months ago
- ☆47Updated 7 months ago
- A collection of papers related to knowledge fusion☆54Updated 4 months ago
- An acceleration library that supports arbitrary bit-width combinatorial quantization operations☆214Updated 4 months ago
- LLM Benchmark for Code☆31Updated 6 months ago
- Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models☆169Updated 3 months ago
- MIXQ: Taming Dynamic Outliers in Mixed-Precision Quantization by Online Prediction☆78Updated 3 months ago
- 保险行业回访外呼机器人☆62Updated last year
- We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that …☆93Updated last year
- Code and data for crosstalk text generation tasks, exploring whether large models and pre-trained language models can understand humor. …☆144Updated 2 years ago
- 教你只用最基本的python语法和numpy一步步实现深度学习框架☆98Updated 6 months ago
- Harnessing the Power of AI to Navigate the Information Age – Uncovering Truth, Promoting Transparency, and Championing Fact-Based Discour…☆151Updated last year
- [ACL2024 Findings] Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM☆56Updated last month
- RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response☆40Updated 2 months ago
- Multilingual Corpus of Web Fiction☆189Updated 7 months ago
- [ACL2024 Findings] Towards Better Question Generation in QA-Based Event Extraction☆43Updated last month
- 【高性能OpenAI LLM服务】通过GPRS+TensorRT-LLM+Tokenizers.cpp实现纯C++版高性能OpenAI LLM服务,支持chat和function call模式,支持ai agent,支持分布式多卡推理,支持多模态,支持gradio聊天界面。☆95Updated last week
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆155Updated 2 months ago
- Collecting personality-indicative data for role-playing agents.☆22Updated this week