ShaohonChen / transformers_from_scratch
pretrain a wiki llm using transformers
☆10Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for transformers_from_scratch
- ☆15Updated this week
- Recursive Abstractive Processing for Tree-Organized Retrieval☆11Updated 5 months ago
- 🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)☆19Updated this week
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated 7 months ago
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…☆11Updated 9 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 7 months ago
- SUS-Chat: Instruction tuning done right☆47Updated 10 months ago
- 使用langchain实现 故事情景生成,情感情景引导,剧情总结,性格分析☆14Updated 5 months ago
- A minimalist benchmarking tool designed to test the routine-generation capabilities of LLMs.☆17Updated 2 weeks ago
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆50Updated 4 months ago
- accelerate generating vector by using onnx model☆12Updated 10 months ago
- 大模型检索增强生成技术最佳实践。☆47Updated 2 months ago
- ☆105Updated last year
- Music large model based on InternLM2-chat.☆21Updated 4 months ago
- 大语言模型训练和服务调研☆34Updated last year
- Imitate OpenAI with Local Models☆85Updated 2 months ago
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆33Updated this week
- A simple way to synthesize LLM training data. (under construction⚠)☆10Updated last week
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆52Updated 7 months ago
- 大型语言模型实战指南:应用实践与场景落地☆37Updated 2 months ago
- 探索 LLM 在法律行业的应用潜力☆27Updated this week
- ☆92Updated 6 months ago
- TianGong-AI-Unstructure☆51Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆124Updated 11 months ago
- LLM RAG 应用,支持 API 调用,语音交互。☆10Updated 4 months ago
- 一个用于BiliBili网站实时热点&舆情分析的AI 智能体☆17Updated this week
- ☆13Updated last month
- ☆21Updated last year
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆56Updated 2 months ago