chenzomi12 / DeepLearningSystemLinks
AI Infra主要是指AI的基础建设,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术。
☆247Updated last year
Alternatives and similar repositories for DeepLearningSystem
Users that are interested in DeepLearningSystem are comparing it to the libraries listed below
Sorting:
- A high-performance inference engine for LLMs, optimized for diverse AI accelerators.☆518Updated last week
- 金融财报问答大模型LLM☆216Updated last year
- [ICCV 2023] RepQ-ViT: Scale Reparameterization for Post-Training Quantization of Vision Transformers☆132Updated last year
- [EMNLP 2025] RAG-Instruct: Boosting LLMs with Diverse Retrieval-Augmented Instructions☆131Updated 5 months ago
- Collection of projects / apps integrated with dify service API.☆20Updated 11 months ago
- ☆623Updated last year
- [ICCV 2023] I-ViT: Integer-only Quantization for Efficient Vision Transformer Inference☆188Updated last year
- Go Out出海第一步,搞定工具库 独立开发者出海技术栈和工具, 收集的一些有用的出海工具和资源,可以帮助你更好地了解和进入海外市场。 挑选标准 帮助独立开发者提升开发效率 帮助独立开发者降低成本 市场上足够流行☆22Updated last year
- 提高微信二维码识别精确率的小工具☆102Updated 5 months ago
- Github开源项目精选栏目,不定期更新☆238Updated last year
- [NeurIPS 2025 spotlight] QFFT, Question-Free Fine-Tuning for Adaptive Reasoning☆91Updated 3 weeks ago
- A Fully Self-Hosted Solution for Full-Duplex Voice Interaction☆254Updated 2 weeks ago
- A third-party React-based web back-end service for XXL-JOB-PANEL-R3.☆99Updated last month
- C++企业微信消息推送服务器☆37Updated 2 years ago
- CMake完整使用教程。CMake教程包括一系列循序渐进的任务,介绍CMake信息,展示如何实现目标。☆276Updated 4 years ago
- A LLM semantic caching system aiming to enhance user experience by reducing response time via cached query-result pairs.☆952Updated 3 months ago
- Accelerate inference without tears☆333Updated 2 weeks ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆85Updated 5 months ago
- A single RPC framework based on Go☆19Updated 2 years ago
- 整理了各大厂的 GitHub 地址及热门开源项目,帮助大家更高效地了解国产开源生态☆105Updated 3 months ago
- ☆319Updated 3 months ago
- Python port of Moses tokenizer, truecaser and normalizer☆107Updated 2 years ago
- OK Computer in a Box: Your Self-Hosted Agent Workflow Layer☆35Updated last week
- 推荐系统的pytorch算法实现☆76Updated last year
- The Plug-and-Play Go Microservices Framework, Unleash the Power of Simplicity: With the Plug-and-Play Go Microservices Framework, we tran…☆123Updated last week
- ☆63Updated last month
- ☆174Updated this week
- A light llama-like llm inference framework based on the triton kernel.☆157Updated 3 weeks ago
- ☆79Updated last year
- This repository organizes materials, recordings, and schedules related to AI-infra learning meetings.☆178Updated 3 weeks ago