xinyuwei-david / david-shareLinks
☆308Updated this week
Alternatives and similar repositories for david-share
Users that are interested in david-share are comparing it to the libraries listed below
Sorting:
- A collection of RAG systems powered by LLM.☆188Updated 3 months ago
- LLM Inference benchmark☆421Updated 11 months ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆256Updated 3 weeks ago
- Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案☆521Updated 7 months ago
- AGI资料汇总学习(主要包括LLM和AIGC),持续更新......☆376Updated this week
- Community maintained hardware plugin for vLLM on Ascend☆791Updated this week
- 通义千问VLLM推理部署DEMO☆581Updated last year
- A pre-built agent for TableGPT2.☆589Updated 2 weeks ago
- Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.☆941Updated 2 weeks ago
- ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆500Updated 2 weeks ago
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆80Updated last month
- LLM101n: Let's build a Storyteller 中文版☆131Updated 10 months ago
- TinyRAG☆307Updated last week
- ☆133Updated 4 months ago
- LLM/MLOps/LLMOps☆94Updated last month
- An easy-to-use framework for modular RAG☆372Updated this week
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆55Updated 11 months ago
- recursive rag with r1 reasoning☆322Updated last month
- 中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆203Updated last year
- 基于ReAct手搓一个Agent Demo☆136Updated last week
- A streamlined and customizable framework for efficient large model evaluation and performance benchmarking☆1,203Updated this week
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆246Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆135Updated 6 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆341Updated 2 months ago
- ☆109Updated 7 months ago
- LLM 推理服务性能测试☆42Updated last year
- ☆229Updated last year
- an intro to retrieval augmented large language model☆297Updated last year
- LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.☆782Updated this week
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR和TTS的开源框架。☆194Updated this week