ziwang-com / AGMLinks
AGM阿格姆:AI基因图谱模型,从token-weight权重微粒角度,探索AI模型,GPT\LLM大模型的内在运作机制。
☆29Updated 2 years ago
Alternatives and similar repositories for AGM
Users that are interested in AGM are comparing it to the libraries listed below
Sorting:
- zero零训练llm调参☆32Updated 2 years ago
- ☆106Updated last year
- AGI模块库架构图☆76Updated 2 years ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- aigc evals☆10Updated last year
- 全球首个StableVicuna中文优化版。☆63Updated 2 years ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆69Updated 2 years ago
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆141Updated last year
- 大语言模型训练和服务调研☆36Updated 2 years ago
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆49Updated this week
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆40Updated last year
- share data, prompt data , pretraining data☆36Updated last year
- GPT+神器,简单实用的一站式AGI架构,内置本地化,LLM模型,agent,矢量数据库,智能链chain☆48Updated 2 years ago
- ☆15Updated last year
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆43Updated 6 months ago
- A light proxy solution for HuggingFace hub.☆47Updated last year
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10Updated last year
- Another ChatGLM2 implementation for GPTQ quantization☆55Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Updated last year
- 基于baichuan-7b的开源多模态大语言模型☆72Updated last year
- Its an open source LLM based on MOE Structure.☆58Updated last year
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆70Updated last year
- 一站式自动化开源标注平台☆77Updated 3 years ago
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆17Updated last year
- XVERSE-7B: A multilingual large language model developed by XVERSE Technology Inc.☆54Updated last year
- 中文原生检索增强生成测评基准☆121Updated last year
- ☆94Updated 8 months ago
- Here is a demo for PDF parser (Including OCR, object detection tools)☆35Updated 10 months ago
- Open efforts to implement ChatGPT-like models and beyond.☆109Updated last year
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然 语言交互)☆91Updated 2 years ago