chenzomi12 / DeepLearningSystemLinks
AI Infra主要是指AI的基础建设,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术。
☆259Updated last year
Alternatives and similar repositories for DeepLearningSystem
Users that are interested in DeepLearningSystem are comparing it to the libraries listed below
Sorting:
- Draft-Target Disaggregation LLM Serving System via Parallel Speculative Decoding.☆158Updated last week
- A high-performance inference engine for LLMs, optimized for diverse AI accelerators.☆988Updated last week
- 金融财报问答大模型LLM☆220Updated last year
- Deep Research☆303Updated 5 months ago
- [ICCV 2023] RepQ-ViT: Scale Reparameterization for Post-Training Quantization of Vision Transformers☆141Updated 2 years ago
- [EMNLP 2025] RAG-Instruct: Boosting LLMs with Diverse Retrieval-Augmented Instructions☆136Updated 9 months ago
- Github开源项目精选栏目,不定期更新☆236Updated 2 years ago
- ☆624Updated last year
- A single RPC framework based on Go☆19Updated 2 years ago
- Go Out出海第一步,搞定工具库 独立开发者出海技术栈和工具, 收集的一些有用的出海工具和资源,可以帮助你更好地了解和进入海外市场。 挑选标准 帮助独立开发者提升开发效率 帮助独立开发者降低成本 市场上足够流行☆22Updated last year
- VisionForge是一个轻量级、高扩展性的大模型图片训练&描述工具生成器,支持多家大模型API(Google、OpenAI 兼容、DeepSeek、Qwen、GLM、Claude、Doubao、自定义模型)。 它提供多图片上传、提示词优化、自动生成JSONL训练数据、多…☆46Updated 2 weeks ago
- Collection of projects / apps integrated with dify service API.☆19Updated last year
- 提高微信二维码识别精确率的小工具☆102Updated 9 months ago
- Trainable fast and memory-efficient sparse attention☆526Updated this week
- ☆118Updated 3 months ago
- ☆26Updated 2 months ago
- C++企业微信消息推送服务器☆38Updated 2 years ago
- Learn how to develop kernels☆107Updated last month
- [ICCV 2023] I-ViT: Integer-only Quantization for Efficient Vision Transformer Inference☆200Updated last year
- ☆523Updated 2 weeks ago
- ☆70Updated 6 months ago
- A unified, efficient, and extensible PyTorch-based recommendation library☆113Updated last week
- 一种任务级GPU算力分时调度的高性能深度学习训练平台☆733Updated 2 years ago
- ☆325Updated 7 months ago
- ☆64Updated last week
- 整理了各大厂的 GitHub 地址及热门开源项目,帮助大家更高效地了解国产开源生态☆113Updated 7 months ago
- OK Computer in a Box: Your Self-Hosted Agent Workflow Layer☆132Updated last month
- A third-party React-based web back-end service for XXL-JOB-PANEL-R3.☆99Updated 5 months ago
- ☆43Updated last week
- [NeurIPS 2025 spotlight] QFFT, Question-Free Fine-Tuning for Adaptive Reasoning☆91Updated 3 months ago