FlyAIBox / dcu-in-actionLinks
国产加速卡-海光DCU实战(大模型训练、微调、推理 等)
☆35Updated last week
Alternatives and similar repositories for dcu-in-action
Users that are interested in dcu-in-action are comparing it to the libraries listed below
Sorting:
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆85Updated last year
- 配合 HAI Platform 使用的集成化用户界面☆52Updated 2 years ago
- [ACL2025 demo track] ROGRAG: A Robustly Optimized GraphRAG Framework☆167Updated last month
- A minimalist benchmarking tool designed to test the routine-generation capabilities of LLMs.☆25Updated 8 months ago
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆88Updated 2 months ago
- The framework of training large language models,support lora, full parameters fine tune etc, define yaml to start training/fine tune of y…☆28Updated 10 months ago
- A simple, High-Performance, Scalable ML/DL Models Repository based on OCI Artifacts☆34Updated last year
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22Updated 2 months ago
- ☆49Updated 4 months ago
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.☆29Updated 4 months ago
- GLM Series Edge Models☆146Updated last month
- vLLM Router☆39Updated last year
- A highly contextualized retrieval system integrating Large Language Models (LLMs), embeddings, and a dynamic agent-driven framework. Supp…☆24Updated 6 months ago
- Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud☆119Updated 2 months ago
- LLM 推理服务性能测试☆44Updated last year
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…☆13Updated last year
- A unified tool to generate fine-tuning datasets for LLMs, including questions, answers, and dialogues. ✨🤖📚💬☆61Updated 4 months ago
- OpsPilot is an open source intelligent operation and maintenance assistant based on deep learning and LLM technology developed by the WeO…☆187Updated 2 months ago
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆57Updated 8 months ago
- The CSGHub SDK is a powerful Python client specifically designed to interact seamlessly with the CSGHub server. This toolkit is engineere…☆17Updated this week
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都 是运行在其上面的软件。☆17Updated last year
- 一个基于多模态向量模型及视觉多模态模型构建的图片搜索引擎&管理系统,实现精准的以文搜文,文搜图、以图搜图多种智能检索方式。An image search engine management system built upon multimodal vector models…☆50Updated last week
- Implementing ReaRAG, a knowledge-guided reasoning model that enhances factual accuracy using iterative retrieval-augmented generation. Ad…☆14Updated 4 months ago
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆40Updated last year
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆205Updated last month
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆69Updated last year
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆18Updated 10 months ago
- MCP-Zero: Active Tool Discovery for Autonomous LLM Agents☆244Updated last month
- GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation☆265Updated last week
- “AI-Compass”将为社区指引在 AI 技术海洋中航行的方向,无论你是初学者还是进阶开发者,都能在这里找到通往 AI 各大方向的路径。旨在帮助开发者系统性地了解 AI 的核心概念、主流技术、前沿趋势,并通过实践掌握从理论到落地的全过程。☆114Updated this week