ArtificialZeng / transformers-Explained
官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for transformers-Explained
- AGM阿格姆:AI基因图谱模型,从token-weight权重微粒角度,探索AI模型,GPT\LLM大模型的内在运作机制。☆25Updated last year
- Large-scale exact string matching tool☆15Updated this week
- 中文大语言模型评测第三期☆24Updated 5 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 6 months ago
- ☆51Updated last month
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆50Updated 3 months ago
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆43Updated 7 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆26Updated 5 months ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆70Updated last year
- Here is a demo for PDF parser (Including OCR, object detection tools)☆31Updated last month
- ☆12Updated 3 weeks ago
- 百度QA100万数据集☆49Updated 11 months ago
- LLM+RAG for QA☆19Updated 9 months ago
- 中文原生检索增强生成测评基准☆99Updated 6 months ago
- ☆34Updated 2 months ago
- 大语言模型训练和服务调研☆34Updated last year
- Recursive Abstractive Processing for Tree-Organized Retrieval☆11Updated 5 months ago
- aigc evals☆10Updated 11 months ago
- Python client designed specifically for large-scale requests to the openai interface☆21Updated 8 months ago
- 🌟 Revolutionize Your Operations with One Sentence Automation: Utilizing large language models and Multi-Agents to generate operational c…☆51Updated last year
- Enhancing Retrieval and Managing Retrieval: 4-Module Synergy☆15Updated last week
- ☆13Updated 11 months ago
- this repo is mnbvc text quality classification using fastText☆14Updated last year
- 通用简单工具项目☆13Updated last month
- GoGPT中文指令数据集构造☆10Updated 9 months ago
- make LLM easier to use☆58Updated last year
- 代码大模型 预训练&微调&DPO 数据处理 业界处理pipeline sota☆26Updated 3 months ago
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆59Updated last month