WALLE-AI / uReasoningLLMsLinks
Deepseek-r1复现科普与资源汇总
☆21Updated 5 months ago
Alternatives and similar repositories for uReasoningLLMs
Users that are interested in uReasoningLLMs are comparing it to the libraries listed below
Sorting:
- 大语言模型训练和服务调研☆37Updated 2 years ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆65Updated last year
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆33Updated last year
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆91Updated 10 months ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆58Updated last year
- ☆27Updated last week
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- ☆125Updated last year
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆31Updated last year
- 多轮共情对话模型PICA☆97Updated last year
- This repository provides an implementation of "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction B…☆73Updated last month
- 想要从零开始训练一个中文的mini大语言模型,可以进行基本的对话,模型大小根据手头的机器决定☆61Updated 11 months ago
- LLM+RAG for QA☆22Updated last year
- 顾名思义:手搓的RAG☆125Updated last year
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆64Updated 11 months ago
- Code for "An Empirical Study of Retrieval Augmented Generation with Chain-of-Thought"☆16Updated last year
- ☆49Updated last year
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research☆22Updated 7 months ago
- 怎么训练一个LLM分词器☆151Updated 2 years ago
- The code repository of paper "TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities"☆20Updated 7 months ago
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆59Updated last year
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated last year
- Fast instruction tuning with Llama2☆11Updated last year
- ☆20Updated last year
- 基于ChatGLM2-6B进行微调,包括全参数、参数有效性、量化感知训练等,可实现指令微调、多轮对话微调等。☆27Updated 2 years ago
- Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud☆120Updated this week
- Scaling Preference Data Curation via Human-AI Synergy☆95Updated last month
- Pretrain、decay、SFT a CodeLLM from scratch 🧙♂️☆36Updated last year
- ☆30Updated 5 months ago